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A method implemented by a computer- 
based electronic assistant to receive and manage 
incoming calls to a subscriber including the steps 
of receiving an incoming call to the subscriber 
from a caller, establishing a first connection be- 
tween the electronic assistant and the caller, es- 
tablishing a second connection between the elec- 
tronic assistant and the subscriber; over the second 
connection, electronically notifying the subscriber 
of the incoming call; in response to receiving a 
call accept command from the subscriber over the 
second connection, linking the caller and the sub- 
scriber so that they may communicate with each 
other upon linking the subscriber to the caller, 
switching the electronic assistant to a background 
mode in which said electronic assistant continues 
to monitor the subscriber over the second con- 
nection while the subscriber is linked with the 
caller, and in response to receiving a summoning 
command, switching the electronic assitant into a 
foreground mode, wherein the electronic assistant 
when in its background mode responds to a first 
set of commands including at least the summon- 
ing command and when in its foreground mode 
responds to a second set of commands where the 
second set of commands is larger than the first set 
of commands. 
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A NETWORK BASED KNOWLEDGEABLE ASSISTANT 
Background of the Invention 
5 The present invention relates to a computer- 

implemented method and apparatus for managing communications 
to and/or from a user over multiple media* 

Today there are many different, commercially 
available devices that enable people to communicate with 

10 each other electronically. In addition to the ubiquitous 
telephone that has been around for decades, there now are 
cordless phones for the home, mobile phones for the car, 
handheld wireless phones which fit into a person's jacket 
pocket, pagers, . local and wide area computer networks, and 

15 facsimile machines, to name a few. Undoubtedly, the number 
and type of devices and their sophistication will continue 
to increase over time. Indeed, it is likely that a day will 
soon arrive when it will be possible for everybody to 
conveniently and inexpensively be within arms reach of some 

20 communication device that enables them to communicate 
electronically with other people. 

The proliferation of different types of 
communication devices and the increasing diversity of 
communications media present new challenges. How will 

25 communications among the different devices and over the 

different communications media be coordinated and managed so 
that people have truly effective access to each other? One 
challenge is associated with communicating information 
between and across different communications media. Another 

30 challenge is related to handling the inevitable increase in 
the number of calls so as to maintain accessibility of 
users. For example, as more people come to rely on their 
wireless phones to transact business while on the road or 
away from their offices, their phones are likely to be busy 
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a larger percentage of the time. As a consequence, although 
a wireless phone can go anywhere with its owner, to the 
people trying to reach that owner when the phone is in use, 
the owner will still seem to be as inaccessible as when he 
5 did not carry a wireless phone. In addition, the more the 
owner of such a device uses it, the more likely it will be 
that he will not know that somebody else was trying to reach 
him and thus he may miss important calls. 

An obvious advantage of many of the new commercially 

10 available communications devices is that they offer the 

possibility of greater mobility to the user. Unfortunately, 
however, it is not always having to be near the office 
telephone that ties a business person to the office. The 
office provides other services that are also important and 

15 may not be so mobile. Thus, to fully realize the greater 
mobility that is offered by the new communications devices 
and media, these new technologies must be provided in a way 
that takes into account the business person's dependance on 
other services besides communications. 

20 summary of the Invention 

The invention described herein is referred to as an 
electronic assistant. It is a computer- implemented entity 
that assists a subscriber with his or her communications by 
carrying out tasks that are delegated to it. The electronic 

25 assistant, modeled to have human-like qualities, recognizes 
speech and performs functions within the familiar model of 
an office. 

Each subscriber who has an account is assigned at 
least one electronic assistant which may be dedicated or 
3 0 shared. The electronic assistant offers services to both 
the subscriber and to any contacts or other callers trying 
to reach the subscriber. A contact is a person, place, or 
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group that the subscriber has described for the electronic 
assistant. A contact can be another subscriber or an 
outside caller. Electronic assistants treat both 
subscribers and contacts as users of the system. 
5 The electronic assistant offers a wide range of 

services to its subscriber, among which are the following. 
The electronic assistant can handle incoming calls from 
several of the subscriber's personal contacts while at the 
same time it is doing any of its tasks, including reviewing 

10 messages with the subscriber, managing information, etc. It 
can make logical decisions about how to manage, whether to 
forward, and where to forward the incoming calls from the 
subscriber's contacts. The electronic assistant can also 
communicate with the subscriber and the subscriber's 

15 contacts using a wide variety of different communication 

devices, such as telephones, fax machines, pagers, computer 
terminals, and communications enabled handheld devices (e.g. 
Personal Digital Assistants otherwise referred to as PDA's) . 
When the subscriber tries to reach a contact or a contact 

20 tries to reach the subscriber, the electronic assistant 
mediates the connection and then remains available to add 
value to the session. The electronic assistant can schedule 
and manage reminders for its subscriber. When reminders 
come due, the electronic assistant notifies the subscriber. 

25 In general, in one aspect, the invention is a method 

implemented by a computer-based electronic assistant to 
receive and manage incoming calls to a subscriber. The 
method includes the steps of: receiving an incoming call to 
the subscriber from a caller; establishing a first 

30 connection between the electronic assistant and the caller; 
establishing a second connection between the electronic 
assistant and the subscriber; over the second connection, 
electronically notifying the subscriber of the incoming 

- 3 - 
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call; in response to receiving a call accept command from 
the subscriber over the second connection, linking the 
caller and the subscriber so that they may communicate with 
each other; upon linking the subscriber to the caller, 
5 switching the electronic assistant to a background mode in 
which said electronic assistant continues to monitor the 
subscriber over the second connection while the subscriber 
is linked with the caller; and in response to receiving a 
summoning command, switching the electronic assistant into a 
10 foreground mode. When in the background mode, the 

electronic assistant responds to a first set of commands 
including at least the summoning command and when in the 
foreground mode, it responds to a second set of commands. 
The second set of commands is larger than the first set of 

15 commands. 

In general, in another aspect, the invention is a 
computer- implemented method of processing an electronic 
reminder that is addressed to a subscriber. The electronic 
reminder includes subscriber-generated content and a 

20 specified time at which it is to be delivered to the 

subscriber. The method includes the steps of: storing the 
electronic reminder in an electronic database that is 
accessible to the electronic assistant; when current time 
coincides with the specified time, detecting that the stored 

25 electronic reminder has become due; in response to detecting 
that the stored electronic reminder has become due, 
identifying a communications device through which the 
subscriber can be reached at the specified time; 
establishing a connection to the communications device; upon 

3 0 reaching an answering party through the communications 

device, electronically notifying the answering party that 
the call is intended for the subscriber; electronically 
informing the answering party that the answering party may 
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accept the call by issuing an accept reply; if the call is 
accepted by the answering party, electronically delivering 
the contents of the electronic reminder to the answering 
party through said communications device. 
5 In general, in yet another aspect, the invention is 

a method implemented by a computer-based electronic 
assistant to receive and manage incoming calls to a 
subscriber. The method includes the steps of: receiving an 
incoming call to the subscriber from a caller; in response 

10 to receiving the incoming call, establishing a first 

connection between the electronic assistant and the caller; 
through a dialog between the electronic assistant and the 
caller over the first connection, determining the identity 
of the caller; detecting that the subscriber is presently 

15 interacting with the electronic assistant through a second 
separate connection; electronically alerting the subscriber 
over the second connection that there is an incoming call 
for the subscriber; electronically identifying to the 
subscriber the identity of the caller; monitoring the second 

2 0 connection for a response sent by the subscriber to the 

electronic assistant directing the electronic assistant how 
to process the incoming call. 

In general, in still another aspect, the invention 
is a method implemented by a computer-based electronic 
25 assistant for managing information and connection resources 
for a plurality of subscribers including a first subscriber 
and a second subscriber. The method includes the steps of: 
receiving a call from the first subscriber to the electronic 
assistant over a communications media; establishing a first 

3 0 connection between the electronic assistant and the first 

subscriber; identifying the first subscriber as the source 
of the call; starting up a first session in said electronic 
assistant which is a thread of execution of code for 
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managing data and performing functions on behalf of the 
first subscriber; within the first session, receiving a 
first command sent by the first subscriber to the electronic 
assistant over the" first connection instructing the 
5 electronic assistant to perform a function relating to the 
second subscriber; responding to the first command by 
sending a first message addressed to the second subscriber 
and containing information relating to the first command; in 
response to the first message, starting up a second session 

10 which is a thread of execution of code for managing data and 
performing functions on behalf of the second subscriber and 
which is separate from the first session; within the second 
session, receiving the first message and performing a 
function that produces a result that is responsive to the 

15 first message. 

In general, in another aspect, the invention is a 
computer- implemented method of processing communications 
through a multimedia interface that includes a plurality of 
interface devices and a plurality of input /output devices. 

20 Each of the interface devices is capable of connecting to a 
different one of a plurality of different communications 
networks, and each of the input/output devices is capable of 
processing a different one of a plurality of media types* 
The method includes the steps of: establishing a channel 

25 representing a physical connection to any selected one of 
the plurality of communications networks through the 
interface devices; attaching an appropriate subset of a 
plurality of ports to the channel, wherein each port 
represents a different one of the input/output devices and 

30 wherein the appropriate subset of ports includes ports which 
correspond to input/ output devices that are capable of 
connecting to the selected communications network; executing 
an operation that generates an item of information that is 

- 6 - 
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to be communicated through the multimedia interface to at 
least one of the communications networks; retrieving from a 
memory a multi-media reference to the item of information, 
wherein the multi-media reference contains a plurality of 
5 references to the item of information, each of which refers 
to the item of information in a different one of a plurality 
of formats, each of which is of a different media type; 
passing the multi-media reference to the ports attached to 
the channel; in response to receiving the multi-media 

10 reference at the attached ports, retrieving the item of 

information from memory in a particular one of the formats 
identified in the multi-media reference; and passing the 
retrieved item of information to an input/output device for 
delivery over the connected communications network. The 

15 step of retrieving is performed by one of the attached ports 
that is capable of processing the format of the retrieved 
item of information and the input /output device to which the 
retrieved item is passed for delivery is the input/ output 
device that is associated with the attached port that 

20 retrieved the item of information from memory. 

In preferred embodiments, the electronic assistant 
enables the individual to manage and customize his 
availablilty to friends, family, business associates, 
customers and strnagers depending upon the time of day, day 

25 of the week and his or her needs. In addition, the 

electronic assistant has the subscriber's schedule and 
therefore knows where the subscriber is, what he is doing, 
what his availability is, and how to reach him (e.g. through 
what communications device) . The schedule is used to manage 

30 the accessibility of the subscriber to others and his 
visibility to other subscribers on the system. 



- 7 - 
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Other advantages and features will become apparent 
from the following description of the preferred embodiment 
and from the claims. 

R yief Des cription of the Drawings 
5 Fig- 1 shows the electronic assistant and office 

items ; 

Fig. 2 shows the hardware platform; 
Fig. 3 is a functional overview of the software 
architecture ; 

10 Fig. 4 is a process view of the software 

architecture; 

Fig. 5 is a functional block diagram of the system 
highlighting the hardware interface to the communications 
channels ; 

xs Figs. 6A-H illustrate the process of completing a 

task and the objects that are involved; 

Fig. 7 illustrates the use of the box, the hand and 
the finger in manipulating items; 

Fig. 8 shows a sample MMUI menu; 
20 Fig. 9 shows a sample MMUI menu with moguls; 

Figs. 10A-D show the relationship between memes, 

menus and moguls; 

Fig. 11 illustrates the process of submitting and 
delivering a message and the objects that are involved; 
25 Fig. 12 illustrates the process of delivering 

reminders and the objects that are involved; 

Fig. 13 shows the contents of a box; 

Fig. 14 shows the contents of a user object; 

Fig. 15 shows the relationship between user objects 

3 0 and contacts; 

Fig. 16 shows the flow of events between the VM and 
the agent sessions and hardware; 

- 8 - 
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Fig. 17 shows the VM internal objects and their 
relationship to each other; 

Figs. 18A-B illustrate the process of creating a 
channel and the objects that are involved; 
5 Fig. 19 illustrates the operations for presenting 

nenes and menus to ports; 

Fig. 20 shows the functional relationships between 
the agents and the applications services; 

Fig. 21 shows the flow of data in an assistant's 
10 session; 

Fig. 22 shows the agent class hierarchy; 

Figs. 23A-D present an example of a three-part 
handshake ; 

Figs. 24A-B is a flow chart of the answer call task; 
15 Figs. 25A-D illustrate the use of a conference 

object in establishing a call between subscribers; 

Fig. 26 is a flow chart of the locate-and-notif y 

task; 

Fig. 27 is a flow chart of the notify task; 
20 Fig. 28 shows a M Create-A-Contact" dialog with the 

electronic assistant; 

Fig. 29 shows a M Remind-Me M dialog with the 
electronic assistant; 

Fig. 30 shows a "Remind-Me-To-Call" dialog with the 
25 electronic assistant; 

Fig. 31 is a flow chart of the handle reminder task; 

Fig. 32 shows a "Find" command dialog; 

Fig. 33 shows an "I-Will-Be" dialog; and 

Fig. 34 shows a M Create-An-Itinerary" dialog. 

30 Description of the Preferred Embodiments 

The Virtual Office: 

- 9 - 
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As illustrated by Fig. 1, a subscriber works with an 
electronic assistant 10 using a "virtual office" as a model* 
The electronic assistant 10 works in an office containing 
the subscriber's objects, which are called "items". An item 
5 is a piece of information that the electronic assistant 

stores in a database and works on for the subscriber. The 
subscriber can use spoken or touch-tone commands to have the 
electronic assistant work on various items, and the 
electronic assistant then uses a dialog to gather the 
10 information it needs from the subscriber to complete the 
task. 

An item may be any one of the following: a schedule 
12, a contact 16, (e.g. a person 18, a place 22, or a group 
20), a message 14, a reminder 24, a phone book 26, or trash 
15 28. Messages can include other items, such as a contact, 
reminder, page, etc. In derivative implementations, a 
message could be any sort of multimedia or composite 
information. 

A "schedule" lists where the subscriber can be 

2 0 reached and at what time. There is a default schedule and 

an override schedule. The default schedule is the 
subscribers daily or weekly schedule; it is entered by a 
system administrator. Subscribers can override the default 
schedule with a »I-Will-Be" or "Create-an-Itinerary" 
25 commands to create an override schedule (see Exhibit A at 

the end of the specification for a list of spoken and touch- 
tone commands that an electronic assistant implements) . The 
"I -Will-Be" command enables the subscriber to indicate when 
and where he can be reached and his availability at those 

3 0 times. 

A "contact" is the subscriber's view of a person, 
place, or group. The subscriber can use a contact as the 
recipient of a command, such as placing a call or sending 

- 10 - 
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voice mail. A contact is analogous to an entry in an 
address book file. An "outside contact" is a contact that 
is not a subscriber* 

A "person" is a representation of a person, which 
5 describes another subscriber or someone outside the system. 
The description includes the spoken name, spelled name, 
priority, gender, and a list of places where the person can 
be reached. 

A "place" is a location with a single address (e.g. 

10 a phone number, fax number, network id, etc.) associated 

with it. Each person can include a standard set of places: 
work, home, car, mobile, and other. A corporation, 
department, or other organization is also considered a 
place, and can include a phone number and fax number. 

15 A "group" is a user-defined set of persons, places, 

and/ or groups. 

A "message" is a piece of information that is 
addressed to a person or group. The most common type of 
message is voice mail* 

20 A "reminder" is a notice to be delivered at a future 

time. The reminder can be a notice to call a contact about 
a particular subject ("call reminder") or a notice 
containing a recorded audio message about another subject 
("recorded reminder") . 

25 A "phone book" is a list of other system users that 

is published for easy access by subscribers. 

The "trash" is a collection of all the items that 
the subscriber has thrown away. 

In the described embodiment, the electronic 

3 0 assistant can recognize specific words or phrases, which are 
called utterances. To recognize a subscriber's speech, the 
electronic assistant gathers training utterances from the 
subscriber, which are recordings of the way the subscriber 

- 11 - 
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pronounces a word or phrase. The process of gathering -these 
training utterances is sometimes referred to as training in 
that the subscriber is training the electronic assistant to 
recognize his voice. For example, the electronic assistant 
5 may ask the subscriber to say his full name several times or 
the name of a command several times. These utterances are 
then compiled into a compressed format known as a 
vocabulary. A "vocabulary" is a finite set of recorded 
words and phrases that can be used directly by the system's 
10 speech recognition hardware to recognize the subscriber's 
speech . 

The electronic assistant uses two different kinds of 
vocabularies, namely, a speaker-dependent vocabulary and a 
speaker- independent vocabulary. Ttie "speaker-dependent 

15 vocabulary is used to recognize the speech of users who have 
explicitly trained the electronic assistant with their own 
pronunciations. The speaker- independent vocabulary is used 
to recognize a multitude of different voices without 
requiring each user to train the system. 

20 In addition to these different kinds of 

vocabularies, the electronic assistant uses two different 
methods to recognize speech, namely, continuous recognition 
and discrete recognition. Continuous recognition recognizes 
naturally spoken words or sequences of words, that is, words 

25 without artificial pauses between them. In the described 
embodiment, there are two vocabularies recognized with 
continuous recognition: numbers (the digits "one" through 
"nine", "zero", and "oh") and yes/no ("yes" and "no"). 
Continuous recognizers also discriminate against words like 

30 "urn" and "ah". Discrete recognition recognizes discrete, 

isolated words or phrases, or sequences of distinctly spoken 
words that are separated by pauses . 



MSDOCID: <W0 961 1542A2J_> 



WO 96/11542 



PCT/US95/1 1737 



The electronic assistant responds to a variety of 
commands (see Exhibit A) . Some commands operate on the 
current context in the hand (described later) , while others 
do not. Some commands start a dialog between the electronic 
5 assistant and the subscriber so the assistant can gather the 
information it needs to carry out the task. For example, to 
call a contact, the subscriber can use the "Call" command: 

Subscriber Assistant 

Call Call Whom? 

10 Bill Bishop Where? 

Work Dialing. . . 

Functional Description of the User Interface 

Before going into the details of the hardware and 

the software architecture of the system, a description of 
15 how the system handles an incoming call to a subscriber will 

first be presented. This will provide the context for then 

describing the underlying mechanisms that are implemented 

within the system to achieve their functionality. 

Throughout the remainder of this specification 
20 except where noted, it will be assumed that the subscriber 

is named John Smith and the caller is named Bill Bishop. 

When a caller calls into the system in an attempt to 

reach a particular subscriber (i.e., by calling a telephone 

number that is assigned to that subscriber), the system 
25 answers the call. By playing back audio signals that are 

stored in the system's database, the system announces to the 

caller: 

Good Morning, I'm the electronic assistant for John 
Smith. Please say your full name. 

3 0 The caller replies by stating his name: 

Bill Bishop 

- 13 - 
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The system records the caller's utterance and using a 
speaker dependent dictionary that is stored within the 
system's database, it attempts to recognize the caller. If 
the system cannot recognize the caller's name, it responds 
5 by playing the following message: 

Using touch- tones, enter your area code and phone 
number followed by the # key. 

The user enters his telephone number as requested. The 
system using decodes the touch tones and then searches a 
10 contact list for the subscriber to find a contact with that 
telephone number. 

If the system succeeds in recognizing the caller on 
the basis of his phone number, it then plays the following 
message to the caller: 

15 Would you like to take a moment to teach me how to 

recognize your name better? 

The caller may accept this offer by pressing w 9" on his 
touch-tone key pad. The system then responds by playing the 
following message: 
20 Please repeat your full name. 

After repeating this operation a second time, the system 
stores the vocalizations with the identity of the contact. 
The next time the caller contacts the system, it will use 
the stored vocalization of the caller's name to recognize 

25 the caller. 

Once the assistant either recognizes the caller 
either through a match with a stored vocalization or through 
the caller's phone number or labels the caller as unknown, 
it then attempts to locate the subscriber. It does this by 

3 0 carrying out a sequence of operations the first of which is 
to check the subscriber's status. If the subscriber 

- 14 - 
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currently has a connection established with his assistant 
(and he has not enabled a do not disturb function) , then his 
status is available. If the subscriber is not connected, 
then the assistant may check a secondary information source 
5 (such as a cellular network) to determine the subscriber's 
availability. Finally, the assistant will check the 
subscriber's schedule. The subscriber can set his 
availability to indicate that he is accepting all calls, he 
is accepting no calls, or he is accepting only important 
10 calls. 

If the subscriber is not accepting any calls, the 
system plays the following message to the caller: 

Sorry, he's not available. Please leave a message and 
then hang up or press the w # w key. Recording. 

15 After the caller has finished, the system may send a non- 
interactive notification to the subscriber that the caller 
has just tried to reach him. It does this by, for example, 
sending a page through its pager interface or sending an E- 
Mail message to the subscriber's workstation. The 

20 notification identifies the caller and it indicates whether 
voice mail was left. 

If the subscriber is accepting only important calls, 
the system checks the subscriber's contact list to determine 
what the caller's priority is. The subscriber can designate 

25 the contact as either high priority or normal priority and 
this information is stored with other information about the 
contact. If the caller has only normal priority, the system 
reports that the subscriber was not available and offers the 
caller the option to leave voice mail, as previously 

3 0 described. On the other hand, if the caller has high 

priority, the system continues its attempt to locate and 
notify the subscriber of the call. 

- 15 - 
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In the event that the subscriber is accepting all 
calls, the system continues its attempt to locate and notify 
the subscriber of the call without regard to the caller's 
priority designation. 
5 As a first step in locating the subscriber, the 

system determines whether the subscriber is already 
connected to the system, either through another call or 
through some other communications medium (e.g. logged into 
his computer) . If the subscriber is on another call being 

10 handled by the system, the system briefly interrupts that 

call to notify the subscriber that he has a call waiting and 
it identifies the name of the caller. If the caller is also 
logged onto the system through his computer, the system may 
also send a visual message to the workstation notifying the 

15 subscriber of the call and identifying the caller. 

The subscriber then has the option of accepting the 
call, asking the system to place the caller on hold while he 
completes his present call, or asking the system to take a 
message. 

20 If the subscriber accepts the call, the electronic 

assistant responds by immediately establishing a connection 
between the caller and the subscriber. If the subscriber 
instructs the system to take a message, the system offers 
the caller the voice mail option previously mentioned. If 

25 the subscriber instructs the system to place the incoming 

call on hold, the system informs the caller that subscriber 
will be with him shortly. When the subscriber has completed 
his other call, using spoken commands, he instructs the 
system to then establish a direct connection with the new 

3 0 caller. 

If the system determines that the subscriber is not 
presently on a call but is connected to the system or 
reachable through his workstation or other two-way data 
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device, it may send a message that is displayed on the 
device's display screen. The message includes a menu which 
offers the subscriber the option to accept or not accept the 
call. If the subscriber accepts the call, the system 
5 notifies the caller that the subscriber will be with him 
shortly. In the meantime, the subscriber calls into the 
system over another phone and the system connects him with 
the caller. 

If the subscriber does not accept the call or does 

10 not respond within some predetermined period of time, the 

system notifies the caller that it was unable to locate the 
caller and offers the caller the option to leave voice mail. 

If the electronic assistant does not detect the 
presence of a subscriber on the system, the electronic 

15 assistant checks whether his location is indicated on one or 
more schedules that the system keeps for the subscriber. If 
there; is a schedule that places the subscriber at a 
particular location at that moment and there are phone 
numbers identified with that location, the electronic 

2 0 assistant places a call to one of the phone numbers. 

Additionally, the assistant may check one or more additional 
sources of subscriber location information (such as cellular 
network databases) . 

If a party answers the call, the electronic 

25 assistant plays the following message to the answering 
party : 

Hello, I'm trying to reach Jim Smith. If he is 
available, press the 9 key. If he is not available, 
press the 6 key or hang up. 

30 If the party answering the call indicates that Jim Smith is 
available, the electronic assistant then says: 
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There is a call from Bill Bishop. Do you want to take 
the call? Indicate Yes by pressing the 9 key, indicate 
No by pressing the 6 key. 

If the party indicates that they will accept the call, the 
5 electronic assistant connects the caller to the subscriber. 

In the event that electronic assistant is unable to 
establish a connection with the subscriber, the electronic 
assistant may send a non-interactive notification to the 
subscriber indicating that Bill Bishop has called him at a 
10 specified time. 

Hardware architecture 

Fig. 2 shows the basic hardware components of the 
described embodiment. The system consists of a high- 
performance 486 computer equipped with an ISA bus 40 with a 

15 passive backplane. The computer includes a CPU card 42 and 
display adapters (not shown) . The passive backplane is a 
standalone bus that is not part of the CPU card (i.e., the 
motherboard) . A set of ISA adapters (not shown) plug into 
the passive backplane to form 486 computer system. 

20 Interface cards 44 and the CPU card connect directly to the 
ISA bus. The passive backplane can hold up to 20 interface 
cards . 

The interface cards are special-purpose cards to 
support many different forms of connectivity and 

25 communication. They include network cards to connect with 
standard digital telephone lines as well as special-purpose 
adapters for recognizing speech, making phone calls, and 
sending and receiving faxes, etc. 

A Multi-Vendor Integration Protocol (MVIP) bus 46 

3 0 consisting of a 40-pin ribbon cable is connected to all of 
the interface cards providing telephone services. The MVIP 
bus is a high-speed communications channel that carries all 
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audio traffic between interface cards and switches telephone 
lines. 

Finally, the system also has fixed and removable 
storage including a set of high-capacity, high-speed disk 
5 drives 48 and a floppy drive 50. 

A base system, supporting 8 ports, has 2GB of disk 
space and 32MB of memory with an additional 1GB of disk 
space and 8MB of memory for each additional set of 8 ports. 
The described embodiment, supports a maximum of 24 ports. 

10 In the described embodiment, the following specific 

hardware is used. The line interface cards are either 
Natural Microsystems DTI-48 Tl cards supporting connection 
of two Tl trunks or Voice Technology Group Voice Bridge PC 
PBX cards supporting 8 lines of PBX station set emulation. 

15 The line processing cards are Natural Microsystems AG-8 
cards, each supporting 8 telephone channels. The ASR 
daughter cards are Natural Microsystems DB-31's and there is 
one DB-31 card for each AG-8 line processing card. The 
daughter card performs speech recognition on names only. 

20 Another ASR card which is provided is a VPro-84 from Voice 
Processing Corporation. There is one VPro-84 card for each 
AG-8 line processing card. The VPro-84 card performs speech 
recognition on commands and digits and supports up to 8 
discrete recognizers or 4 continuous recognizers. 

25 The MVIP bus supports up to 256 full-duplex 

telephone connections. The 256 full-duplex connections are 
time-division multiplexed (TDM) so that only 32 separate 
signals are transmitted, with each signal divided into 8 
time slots. Interconnections between two communications 

3 0 channels is accomplished by enabling through software 

control each of the channels to have access the time slots 
of the other channel. That is, the input of each channel is 
permitted to listen to the output of the other channel. 
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Other Possible Hardware Configurations 

The description above presents one possible 
architecture which concentrates a number of hardware 
elements within a single computer chassis, including 
5 multiple special purpose interface cards connected together 
with a special purpose bus. Many other hardware 
organizations could also be used to support the 
functionality described. The basic abstract elements needed 
are: (1) one or more basic computer resources to support the 

10 program and data as described; (2) support for one or more 

incoming communications channels and support for receipt and 
generation of connections on the respective channels; and 
(3) a switching resource to switch multiple communications 
channels together 

!5 it is possible for these resources to be distributed 

across multiple systems. For example, in one possible 
implementation a separate switch resource could be connected 
to computers which contain hardware for managing 
communications channels. These computers which manage the 

20 communications channels are then connected via a network to 
larger systems which run the applications and provide 
database services. The applications and database services 
could be further split up across multiple systems. 

In yet other hardware configurations, one might wish 

25 to consolidate the resources even more than is described for 
the present embodiment. For example, a single computer 
could have a plug in card, or support on the mother board, 
- to handle the communications channels. Switching could be 
done between these channels in hardware or software. In 

30 this way the applications and database would run on a single 
computer with the necessary hardware support to manage all 
necessary communications channels. 
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Software architecture 

This following description presents two different 
views of the system's software architecture: one emphasizing 
functional components and the other emphasizing processes 
5 and events. 

Functional View: 

Fig. 3 shows the high-level software architecture of 
the system. In this and subsequent illustrations of 
processes, the structure objects are represented 

10 symbolically by icons which have an appearance relating to 
the function of the entity being represented. 

The system includes four primary components. One 
primary component includes assistants 60 and agents 62. The 
assistants carry out tasks on behalf of users and the agents 

15 carry out tasks on behalf of assistants. Another primary 
component is a communication mechanism including a 
Multimedia User Interface (MMUI) 64 and a parcel mechanism 
66. These allow assistants and agents to communicate with 
other system components. A third primary component is an 

20 object database 68 which stores user information, such as 
contacts and messages , and system information, such as 
prompts. And the fourth primary component is a Virtual 
Machine (VM) 70 which services requests from assistants and 
agents as well as hardware devices. The arrows in Fig. 3 

25 show the primary paths for the flow of interactions between 
the various components. 

The described embodiment uses the Univel UnixWare™ 
operating system 72 which is based on the UNIX System V 
Release 4.2 operating system. The object database is the 

30 ObjectStore™ object-oriented database by Object Design, 
Inc. 
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Other Possible Software Configurations 

In the described embodiment, the VM, Assistants, 
Agents and Database all reside on one host computer. There 
is nothing in the architecture, however, that necessitates 
5 this. Other implementations could separate these components 
and have them run on separate computers using the previously 
mentioned remote procedure calls to communicate between the 
different processes. 

Process View 

10 Fig. 4 summarizes the process architecture of the 

system. The system consists of a Virtual Machine service 
process 80 (vmserver) and one process 82 for each instance 
of an electronic assistant or agent. Each session process 
communicates with the vmserver using RemoteProcedureCalls 

15 (RPCs) 84. The vmserver services these RPCs from the 

session processes and also services events 86 from hardware 
devices 88. 

The Interfaces to the Communications Media: 
Fig. 5 is a block diagram view of the system 

20 highlighting the hardware interface to various 

communications channels. A central office 90 which receives 
incoming calls from telephones 92 and fax machines 94 
forwards them over a Tl line to a Tl interface card 96 in 
the system. The Tl interface cards are connected to the 

25 MVIP bus 46 along with other cards including a line 

processor card 98, a speech recognizer card 100 and a fax 
board card 102. The line processing card does coding and 
decoding of speech, i.e., it plays audio that is stored in 
the database and records audio for storage in the database. 

30 It also decodes touch-tone signals (DTMF) . The speech 

recognition card performs the speech recognition function 
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using a vocabulary supplied to it from the database. The 
vocabulary might be a speaker dependent vocabulary generated 
by the user or a speaker independent library. 

There may be one or more Tl lines coming into the 
5 system. It is likely, however, that there will be many more 
phone numbers assigned to the system than there are Tl 
lines. Numbers are not mapped to particular lines. Rather, 
when the central office receives a call intended for one of 
those numbers, it simply selects from among whatever lines 

10 are available at the time and sends a ring signal to the 

system over the selected line. When the interface card for 
that line answers the incoming call (i.e., connects to the 
line) , it receives from the central office a sequence of 
touch tones (four digits) identifying the extension that is 

15 being called. Using this four-digit sequence identifying 
the extension and the subscriber's numbers stored in its 
database, the system is able to identify the subscriber for 
whom the call is intended. 

In the described embodiment, the system can also 

20 establish connections to a Wide Area Network (WAN) or a 

Local Area Network (LAN) 104 through an ethernet card 106. 
In addition, it can establish connections to various other 
communications devices through one or more serial interface 
cards 108. In Fig. 5, serial card 108 has two serial ports, 

25 one of which is connected to a SkyTel w system 110 over one 

serial line and the other of which is connected to a two-way 
modem 112 over another serial line. The SkyTel 1 - system 110 
transmits one way communications to pagers 114. The two-way 
modem 112 exchanges two-way communications with wireless 

30 devices 116 such as PDA's (personal digital assistants). 
Fig. 5 also presents a logical view of the 
mechanisms which the system uses to connect to 
communications channels through the interface cards. To 
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connect the system to different communications media, the 
system establishes various channels or communications paths 
including a phone line channel 120, a fax channel 122, a TTY 
line channel 124, a two-way data line channel 126, and a 
5 batch pager channel 128. For each channel, there is a set 
of ports that can be attached to it. The ports, which are 
represented in software by port objects, refer to 
input/output devices supported on the interface cards. 

In the software, each of the line channels is 

10 represented by an object, i.e., a data structure which 
identifies the physical line to which that particular 
channel maps. The data structure also includes a list of 
the ports which can be attached to the channel. Each port 
is represented by a port object and is supported by the 

15 functionality found on the interface cards that are 

connected to the system. That is, each port represents a 
logical digital signal processor on the particular interface 
card which implements the functionality associated with that 
port- 

20 Fig. 5 shows the ports that can be attached to the 

various channels. The phone-line channel can have a DTMF 
port 130, an audio-in port 132, an audio-out port 134, and 
an ASR (automatic speech recognition) port 136. A fax 
channel can have a fax-in port 138 and a fax-out port 140. 

25 Both the TTY line channel and the two-way data line channel 
can have a text-in port 142 and a text-out port 144. The 
batch pager channel can have a text-out port 144. 

For the phone line channel, the DTMF port represents 
the capability of receiving and interpreting DTMF signals 

3 0 sent to the system over the Tl line by the caller. The 

audio-out and audio-in ports represent the capability on the 
line processor cards to both generate and record audio over 
the communications channel. The ASR port which is 
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implemented by the speech recognition card performs the 
automatic speech recognition on the audio. 

Note that throughout this description the term 
"call" is used in its most general sense. Not only does it 
5 include a call placed over the telephone lines but it also 
includes the initiation of any contact over any of the other 
communications media including wireless communication 
channels, computer networks, fax channels, etc. Thus, the 
concept of a call is not meant to be limited only to a 

10 telephone call. 

While the described embodiment includes a limited 
number of channels and ports, the architecture can be 
expanded to handle new channels or ports providing for 
future forms of connections and capabilities* 

15 The four primary software components will now be 

described in greater detail. 

Assistants and agents 

The system is designed to support many different 
kinds of agents. All agents and assistants are based on a 
20 generic agent object. Before exploring the specific agents 
used in the system, this generic agent will first be 
described. 

Agents, sessions and gadgets: 

An agent is a software entity that performs an 
25 action or brings about a certain result on behalf of a user 
or another agent. To communicate with users, an agent 
engages in dialogs using gadgets. A gadget is a 
representation of a communications device, such as a phone, 
fax machine or pager. A gadget includes a description of 
3 0 the communications device as well as its address. For 

example, a phone gadget contains the area code, number, and 
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extension for a particular telephone. To communicate with a 
specific user over the telephone, an agent uses the phone 
gadget for that user. 

To use a gadget, the agent adds it to a session. In 
5 the case of a phone gadget, the VM dials the phone number 
and returns an active gadget to the agent's session. An 
active gadget represents the connection from an agent's 
session through a communications network to a communications 
device. The active gadget represents a transient connection 

10 to the persistent gadget. An active gadget can toe used for 
communicating with users, while a gadget itself cannot 
because it just stores the address and capabilities of the 
device. For example, a phone gadget specifies the number 
617-555-1212, while the active gadget represents an active 

15 phone line on which the number has already been dialed. A 
session consists of a collection of zero or more active 
gadgets connected together. 

To communicate with users, agents may need to add 
capabilities to active gadgets. A capability is a 

2 0 representation of an ability or feature of a device, such as 

the ability to recognize speech or play audio. For example, 
if an agent needs to fax materials to a user, it adds the 
fax capability to the active gadget. A discussion about how 
the VM handles capability requests from agents is presented 
25 later in the discussion of the VM. For other embodiments 
that include a graphical interface, a capability that 
outputs graphics and input capabilities that track the 
user's focus (i.e., mouse, eye, etc.) and gathers input from 
the user is envisioned. 

3 0 Agents can work with many different kinds of active 

gadgets simultaneously. For example, an agent can be 
simultaneously placing an outgoing call, sending a message 
to a pager, and sending a message to a fax machine. 
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User agents, tasks, and presenters: 
A user agent is an agent that is capable of 
communicating with humans as well as with gadgets. User 
agents use media-independent dialogs to communicate with 
5 users. These dialogs consist of a prompt from the user 
agent and a response from the user. When a user agent is 
communicating with a user, it directs its dialogs to an 
active gadget known as the focus gadget. For example, if a 
user agent is placing a call for the user with one active 

10 gadget and talking to the user with another active gadget, 
the latter gadget is the focus gadget. Other embodiments 
may group a set of gadgets to be used as the focus gadget. 
This would allow the assistant to interact with the gadget 
set and have the interaction span all gadgets in the set. 

15 User agents can also connect active gadgets together 

for phone conversations. When the user agent places an 
outgoing call for the user, and the called party answers the 
phone, the user agent connects the two active gadgets 
together and then goes in the background. When the user 

20 agent is in the background, it is idle; to delegate 

additional tasks to the user agent, the user needs to bring 
the user agent back into the foreground. 

A unigue feature of user agents is their ability to 
handle tasks. A task is an action to be carried out by a 

25 user agent on behalf of a user. A task may consist of a 

form full of fields that the agent gives to a presenter. A 
presenter is an object that knows how to engage in a dialog 
with a user over a given medium as part of filling in the 
fields of a task. An agent gives each task to a presenter. 

30 The presenter gathers information for a field and then hands 
the task back to the agent. The agent looks to see if a 
field changed recently and examines it. For example, the 
agent may request information from the database and adjust 
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the next field in the form. Then the agent hands the task 
back to the presenter, and it gathers information for the 
changed field. This activity of handing the task back and 
forth between the agent and the presenter continues until 
5 the agent is satisfied that all required fields are 
complete. Then, the agent executes the task. 

Figs. 6A-H illustrate the process of assigning a 
task and monitoring its progress. In this example, a user 
agent 150 has obtained a focus active gadget 152 for its 

10 session 154 and a user 156 has issued the "call" voice 

command (Fig. 6A) . In response, the agent assigns a call 
task 158 to a phone presenter 160 (Fig. 6B) . The presenter 
interacts with the user over the focus gadget to fill in the 
fields of the task (Fig. 6C) . The presenter gathers the 

15 name of the contact to call and places it in the first field 
before handing it back to the agent (Fig. 6D) . The agent 
notices that this field changed and looks up the contact in 
the object database 68 (Fig. 6E) . The agent adjusts the 
second field of the form to include the valid places defined 

20 for this contact and hands the task back to the presenter. 

The presenter gathers the location where to call the contact 
and places it in the second field before handing it back to 
the agent (Figs. 6E-F) . Once the fields of the task are 
filled, the user agent places the call (Fig. 6H) . 

25 A user agent obtains the list of tasks to carry out 

from an object known as the task stack. When the agent 
receives parcels (to be described shortly) , it may decide to 
place tasks on the task stack as a result of processing the 
contents of the parcel. The user agent gets the next task 

3 0 from the stack and carries it out. It repeats this process 
until there are no more tasks on the stack, and then it 
exits. In the process of carrying out a task, another task 
may be pushed on the stack. 
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A task is not the only way that an agent interacts 
with a user. For simple interactions, the agent uses a C++ 
method. Tasks are used primarily for complex interactions. 

Assistants and Electronic Assistants: 
5 An assistant is a user agent that is capable of 

making logical decisions and performing complex tasks on 
behalf of its users. The entity that will be referred to 
hereinafter as the electronic assistant is an assistant that 
assists users with their calls, messages, contacts, and 

10 schedule. Each subscriber has an electronic assistant. As 
an example of a complex task that distinguishes the 
Electronic Assistant from an ordinary user agent, the 
electronic assistant can locate a user by consulting the 
user's schedule and deciding which numbers to call. 

15 up to this point, three different classes of agents 

have been discussed: agents, user agents, and assistants. 
Table I summarizes the distinctions among them. 



Distinctions Between Agents and Assistants 



20 



Agent Type 


Description 


Agent 


A software entity that performs an action 
or brings about a certain result on behalf 
of a user or another agent. 


User agent 


An agent capable of communicating with 
humans using media- independent dialogs and 
carrying out tasks on a user's behalf. 


Assistant 


A user agent capable of making logical 
decisions and performing complex tasks. 



TABLE I 

Any agent can logically run several sessions 
25 simultaneously. Each time an agent handles a session, there 
is a separate process running for each session. The master 
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session is the session in which the electronic assistant is 
talking to its subscriber or trying to locate its 
subscriber. There can never be more than one master session 
running per subscriber. 

5 Box, Hand and Finger: 

Referring to Fig. 7 , the electronic assistant uses 
three different objects to manipulate its user's items, 
namely, a box 170, a hand 180, and a finger 182. The box is 
an object that contains all the items belonging to a user. 

10 The hand is an object that holds the items that the user is 
manipulating. The finger is a software pointer that marks 
the currently selected item in the hand. 

When the subscriber asks the electronic assistant to 
"Find" items, such as contacts, the electronic assistant 

15 looks in the box for the items and then picks them up in its 
hand. The hand can pick up a subset of the items in a box 
according to certain criteria. For example, when the 
subscriber asks the electronic assistant to find saved 
messages, it only finds the messages that are marked with a 

20 saved flag. 

When the user asks for the first item, the 
electronic assistant moves its finger to that item and 
selects it. If the user asks for the next item, the 
electronic assistant moves its finger to the next item in 

25 its hand. The items in the hand can be accessed in a 

circular fashion. If the finger points to the last item in 
the hand and the user asks for the next item, the finger 
moves around to the first item in the hand. 

Sequential vs. Random Access to Lists of Information 
30 Using first-item and next-item provides the ability 

to browse a set of items in a sequential fashion. So, if a. 
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subscriber instructed their assistant to Find New— Message, 
then using next item and previous item would give the 
ability to traverse through the new messages in forward or 
reverse order. First item would return the finger to point 
5 to the beginning of the list. 

In addition to this form of sequential navigation 
through items in a list, the system also provides 
capabilities to access information randomly or based on more 
complex queries. For instance, a subscriber can ask to see 
10 all the new message from a particular contact. The 
following dialog: 

Find <find what> 

New-Messages-From <new message from whom> 
Bill-Bishop 

15 would put into the hand all messages that had been received 
from the contact named Bill Bishop. This feature can also 
be extended to the group items which a subscriber can 
create. As a result, if a subscriber has a group called 
Hot-Prospects, the above dialog could be repeated as: 

20 Find <find what> 

New-Messages-From <new message from whom> 
Hot-Prospects 

and the result would be a list of items that includes new 
messages from members of the group Hot-Prospects. In 
25 addition to searching for items from a particular contact, 
this technique can be used to fetch items that have been 
stamped as priority or urgent, etc; Finally, all of the 
items described herein can be randomly searched for and 
collected into the hand based on different attributes. 

3 0 Context: 

The ability to Find items and gather them into the 
hand introduces into the system a concept of state. This 
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state includes the current contents of the hand and what is 
being pointed to. Since some number of commands may need to 
be issued to change the state, and since it is often 
desirable to go back to one of the previous state, the 
5 system supports an ability to "Go-Back" to the previous 
states. Each time a change in the state of the assistant 
occurs, the new state is stacked on top of the previous 
state. Issuing the Go-Back command pops the previous state 
of the stack. 

10 For example, let's assume the assistant is holding 

and pointing to a contact named Bill Bishop and the 
subscriber issues the following commands: 



In this example, the subscriber replaced the contents of the 
hand (which had been holding a contact for Bill Bishop) with 
a new message from Bill Bishop. After listening to the 

2 0 message (as a result of issuing the What ' s-It-Say command) 
the subscriber said Go-Back. The result of this command is 
that the previous state of the hand (the single contact for 
Bill Bishop) is restored. The subscriber can now manipulate 
this contact for Bill Bishop. 

25 Another useful tool for managing state is the 

ability to refer to the current item being pointed to in 
dialogs. The utterance "This-One"' is used to refer to an 
- object currently being pointed to by the assistant in the 
hand. The item is also referred to as "it" so that commands 

30 such as "Describe-It" , "Update-It", "Throw-It-Away" also 

refer to the current item. Finally, the commands "Send-A- 
Copy", "Send-A-Reply" and "Give-Them-A-Call" are operations 



Find 

New-Message 
What's-It-Say 



<find what> 

<one new message from Bill Bishop> 



15 



GoBack 
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on the current items to which the assistant's finger is 
pointing. These draw on the context and leverage on a 
shared understanding between the subscriber and the 
assistant of the context. The result is a set of condensed 
5 dialogs that do not need to explicitly refer to objects 

which can be inferred. Other embodiments can take advantage 
of pronouns such as "him", "her" and "them" to refer to 
objects that have recently been referenced. 

Parcels 

10 Agents can communicate with each other using 

parcels. A parcel contains the address of the sender 
(From) , the address of the recipient (To) , and contents (a 
persistent object) . The contents of a parcel can also be 
another parcel. 

15 The VM guarantees delivery of parcels. Any parcels 

addressed to a user are delivered to the master session for 
that user. If no master session exists, the VM starts one. 
Then, the electronic assistant tries to locate the user from 
within the master session. In other embodiments, parcel 

20 addresses can support distributed boxes and may support 
communication between remote agents . 

Agent to Agent Communication 

The fact that the VM acts as the routing and 
delivery mechanism for parcels is powerful. It allows all 

25 agent to agent communications to be mediated through a 

mechanism that guarantees the behavior that a recipient of a 
parcel will be either located or started to receive the 
parcel. In the described embodiment, the agent to agent 
communication exists as communication between two agents on 

3 0 a single box. In other embodiments which support 

distributed systems, the VM can use routing information 
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embedded in the (To) address to route the parcel to the 
appropriate remote or clustered system. An agent running on 
one coast could check to see if an agent for a user across 
the country wants to accept a call, and all the messaging 
5 could be happening across SS7 (signaling system-7 - a phone 
network) or some other wide area network without the need to 
do a call setup. 

MMUI: 

The MMUI is a media-independent interface for 

10 communicating with users. The MMUI allows agents to focus 
on the content to be communicated rather than the format of 
the content and the details of using specific devices in 
presenting content. 

The fundamental building block of the MMUI is an 

15 object referred to herein as a meme. A meme is a media- 
independent reference to a piece of information. The meme 
contains a set of media objects which store the information 
in a variety of different formats. A media object is a 
piece of information, such as a sound, text string, or DTMF 

2 0 sequence, that can be presented to a particular kind of 
communications device . 

Agents use memes and menus as part of their dialogs 
with users. These dialogs can be part of a task or outside 
of a task, such as from a C++ method. For example, as part 

25 of a call task, an electronic assistant needs to ask the 
user who to call. Within a field of the call task is a 
"Call Whom?" meme, which the presenter gives to the active 
gadget. The "Call Whom?" meme consists of several different 
media objects: for example, an audio recording for use on 

30 the telephone, or a text string for displaying on a computer 
monitor. The active gadget passes the meme to the VM, which 
selects the type of media based on the capabilities of the 
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active gadget. For example, if the active gadget is a 
telephone, the VM selects the audio recording in the meme, 
and the user hears something like "Call Whom?". 

The type of gadget is the primary factor but not the 
5 only factor that affects what type of media is used from a 
given meme. Users can set preferences for their electronic 
assistants, and system administrators and system integrators 
can set system-wide preferences that apply to all electronic 
assistants. For example, a user may select verbose prompts 

10 and a male voice for an electronic assistant. The system 
integrator may have set up the system with Spanish as the 
default language. These preferences are known as attributes 
in the MMUI. An attribute is a name/ value pair. Typically, 
an attribute is used to tailor the behavior of an active 

15 gadget and of all MMUI elements. 

When an electronic assistant obtains an active 
gadget from the VM, it sets the attributes on it. Later on 
in the session, when the electronic assistant sends memes to 
the gadget (in this example, as part of a task) , the 

20 attributes and the type of gadget determine what media 

object in a meme is used. For example, in a bilingual voice 
system, memes may have two audio media objects: one with the 
information recorded in French, and another with it recorded 
in English. If the user had set the language attribute to 

25 French and the gender attribute to male in the previous 

example, the resulting prompt would be spoken in French with 
a male voice. 

Another important building block in the MMUI is the 
midget. A midget is an object used for constructing a 
30 multimedia dialog. A midget in a multimedia user interface 
is analogous to a widget in a graphical user interface. The 
most common type of midget is a menu. Referring to Fig. 8, 
a menu 180 is a set of choices that can be presented to a 
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user using multiple media, A menu consists of rows 
describing each choice (a meme 182) and columns describing 
the media 184 that can be used to present the meme in each 
row. An additional column optionally stores a pointer 186 
5 to the referenced data object in the database e.g. a contact 
object. In some cases, columns are blank if it is not 
possible or appropriate to present the meme using that type 
of media. 

In a typical dialog, the electronic assistant passes 

10 a meme and a menu to the active gadget. The meme is 
presented to the user, the user's response is matched 
against the rows in the menu, and the menu row selected 
(known as the menu pick) and the data pointer (if present) 
are returned to the electronic assistant. For example, in 

15 the case of the call task, the electronic assistant passes a 
meme for presenting the question "Call Whom?" and a menu 
containing the user's contact list. In this example, the 
MMUI menu has four columns. The first column contains the 
text spelling of the contact name; the second column 

20 contains a string describing the DTMF sequence that can be 
used as a shortcut for the contact name; the third column 
contains the vocabulary used for recognizing the spoken 
contact name; and the fourth column contains a pointer to 
the contact's information in the database. 

25 Although this example menu has four columns, menus 

can have more columns to accommodate different attributes. 
For example, in a command menu, there could be verbose and 
terse versions of the spoken command or versions spoken in 
different languages. 

30 Some menus contain additional objects, called 

moguls, attached to them for managing complex media types. 
A mogul is an object that manages a particular type of 
media. A mogul can store media-specific information. For 
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example, an audio mogul is attached to the audio media 
column of a menu and stores the speech-recognition 
vocabulary to be downloaded on the recognizer hardware. A 
mogul can update the content of certain media in the memes 
5 in a menu when updates occur to other media in the memes . 
for example, a DTMF mogul updates DTMF media objects 
whenever their text media counterparts change. In Fig. 8, 
if the subscriber changes the spelling of Susan Schmidt to 
Susan Smith, the DTMF mogul would change "773" in column two 

10 of the menu to "776" . Media that are updated by a mogul 
instead of created by the user are called mogul-generated 
media. DTMF media is an example of mogul-generated media. 

The algorithm to generate DTMF for a name or command 
is as follows: DTMF commands are usually at least three 

15 digits. If there is one word, then map the first three 

letters to the corresponding telephone key that has those 
letters. If there are two words then map the first letter 
of the first word and use the two first letters of the 
second word. If there are three words then take the first 

20 letter of each word. In the case where three letters is not 
enough because the command is still not unique, then 
continue to take the first letter of subsequent words until 
you have a unique sequence. 

Fig. 9 shows the contact menu with the DTMF mogul 

25 188 and audio mogul 190 attached to it. Moguls form a sort 
of third dimension to menus; they can be thought of as the 
depth of a menu. 

Figs. 10A-D show the "Call Whom?" example previously 
discussed. First, the presenter 160 passes a meme 182 and a 

30 menu 194 to the active gadget 152 (Fig. 10A) . The menu has 
two moguls attached to it: a DTMF mogul 196 and an audio 
mogul 198. When the VM receives the menu, it unpacks the 
vocabulary stored in the audio mogul and downloads it on the 
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recognizer hardware. Since the active gadget is a telephone 
with speech recognition capabilities, the subscriber hears 
the electronic assistant say "Call Whom?" (Fig. 10B) . When 
the subscriber responds with "Susan Schmidt" (Fig. IOC), the 
5 recognizer hardware uses the vocabulary to find a match in 
the contact menu and returns the menu pick to the electronic 
assistant (Fig. 10D) . 

Hemes are often strung together in a meme list to 
form a complete statement. The agent or presenter can pass 
10 a single meme or a meme list to the active gadget. When a 
meme list is passed to the active gadget, the memes are 
played in the order in which they are listed. 

Utility agents: 

The parcel mechanism provides a way of communicating 

15 between agents and sessions. Utility agents carry out 
actions on behalf of the electronic assistant, such as 
delivering messages, scheduling reminders, and answering 
unassigned phone lines. There are several different types 
of utility agents including a postmaster agent, various 

20 courier agents, a cron agent and a secret agent. 

The postmaster agent is an agent t^iat receives 
messages from electronic assistants and distributes them to 
appropriate courier agents for delivery. The courier agents 
are agents that receive messages from the postmaster agent 

25 and deliver them to their destination. The message store is 
a portion of the object database containing the incoming and 
outgoing messages for all persons. 

Referring to Fig. 11, when Susan's electronic 
assistant 156 submits a message 200 for delivery, the parcel 

30 mechanism packages a reference to the message in a parcel 
202 and sends it to a postmaster agent 204. The parcel 
mechanism also inserts the message into the appropriate 
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user's section of the message store 206, In return, the 
postmaster agent 204 packages a chit 208 in a parcel 210 and 
sends it to the electronic assistant. A chit is a receipt 
issued by the postmaster agent, which an electronic 
5 assistant can use to check on the delivery status of the 
message. 

As the postmaster agent receives messages for 
delivery, it selects the appropriate courier agent 212 that 
should deliver it. Different gadgets require different 

10 types of courier agents. For example, the system courier 
agent delivers messages for electronic assistants, and a 
SkyTel* courier agent delivers messages to SkyTel pagers. 
When the postmaster agent gives a message to the system 
courier agent, it sends a "wake up" parcel 214 to John's 

15 electronic assistant 216 to notify it about the new message. 
The user's electronic assistant retrieves the message from 
the message store. 

If the courier agent could not deliver the message, 
it returns the parcel containing the reference to the 

20 message back to the postmaster. Depending on the number of 
retries permitted for the message, the postmaster may ask 
the courier agent to retry the delivery or mark the delivery 
as failed. 

Cron agent: 

25 The cron agent is an agent that receives reminders 

from agents, tracks them until they are scheduled for 
delivery, and then delivers them to agents. The cron agent 
may also track other items. Therefore, the parcel can 
actually contain any database object, not just a reminder. 

30 Referring to Fig. 12, when an electronic assistant 

150 submits a reminder for delivery, it creates a parcel 220 
containing a reminder 222, and the parcel is both from the 
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user and to the user. Next, it takes this parcel and places 
it in another parcel 224, which is addressed to the cron 
agent from the user. This latter parcel is submitted to a 
cron agent 226. When the cron agent receives this parcel, 
5 it unpacks the parcel inside it. It then places the parcel 
in a queue 228 in time-sorted order with other parcels and 
keeps track of when this reminder should be delivered. When 
the time comes, sends this inner parcel back to the 
electronic assistant. 

10 Object database 

The object datedzase is an object-oriented database, 
i.e., it is a database that maintains object structures and 
relationships directly rather than flattening them and 
reconstructing them. The object database stores all the 

15 information that both users and the system need across 

sessions. For example, if a user creates a new contact, the 
electronic assistant stores it in the object database. When 
the electronic assistant speaks to a user, it is using 
prerecorded media objects that are stored in the object 

20 database. 

The object database stores a user's information 
using three different objects: a box object, a message store 
object and a user object- The box object and the message 
store were previously mentioned. The user object describes 

25 a subscriber or person. 

Referring to Fig. 13, for each subscriber's 
assistant, the object database stores a box 230 containing: 
a pointer 23 2 to the user object 2 34 describing that 
subscriber; a contact list 236 which is a MMUI menu 

30 containing all of the user's contacts; and a pointer 23 8 to 
the section of the message store 206 containing the user's 
messages. The contact list is used with electronic 
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assistant commands, such as "Call" and "Send Voice Mail", 
which expect the user to specify a contact, 

A stored contact can also include a pointer to a 
note (e.g. a voice message) which the subscriber can 
5 generate and attach to the contact. When the subscriber 

instructs his electronic assistant to call the contact, the 
subscriber can also have the electronic assistant play back 
the attached note while the electronic assistant is 
attempting to establish the connection. The note might 
10 include information about the contact which the subscriber 
wishes to be reminded of whenever he calls that contact. 
For example, he may wish to know the name of contact's 
secretary so that he can address her by name if she answers 
the phone. 

15 Referring to Fig. 14, the database maintains a 

single definition for each user in the system, which is 
known as a user object 234. The user object specifies the 
user's password, gender, and schedule. It also contains a 
pointer 240 to the contact list 236 in the box 230, a list 

20 of places 242 (i.e., home, work, car, mobile, other, pager), 
a list of groups 244 and a list of any reminders 24 6 that 
the subscriber has generated. 

In the database, a person contact consists of a 
reference to a user object plus local information, such as 

25 the way the user pronounces the contact's name and a 

priority. Therefore, not only does the box point to a user 
object, but each person contact in the contact list in a box 
points to a user object. A group contact consists of a set 
of object IDs for user objects. A place contact consists of 

3 0 a phone gadget and fax gadget. 

Referring to Fig. 15, the object database also 
contains phone book objects 250. A phone book is a list of 
other subscribers on the system and consists of a set of 
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pointers 252 to user objects. There may be one or more 
phone books, but any subscriber can access only one phone 
book. Each entry in a phone book is a reference to one of 
the user objects.- Likewise, a user's contact 254 consists 
5 of a reference 256 to a user object plus some local 
information. 

When a user changes the portion of a contact that is 
stored in the user object, such as the work or home phone 
number, all phone books and contacts referencing that user 
10 object show the new information. 

For every user item, the object database also keeps 
general information necessary for accessing that item. This 
information includes the object ID, object label, owner, 
time of last modification, time of last access, and time of 
15 last change to ownership. The object label is a meme that 
can include the spoken and spelled names of the item. Each 
item also has a set of associated flags, such as read, 
unread, important, and not important. 

Virtual Machine 

20 The Virtual Machine is the system's operating 

system, it is a process that allocates and manages system 
resources for agents and assistants. The VM responds to 
requests from agent sessions (just as an operating system 
responds to system calls) and events generated by the 

25 hardware. In essence, the VM is a large event processor. 
The VM initiates all I/O operations, including input 
(recording memes) , output (playing memes) , and recognition 
(recognizing speech or DTMF) . In addition, the VM initiates 
connections to gadgets, disconnects connections to gadgets, 

3 0 and handles incoming connections. 
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Handling events: 

Referring to Fig. 16 , the VM 70 responds to three 
kinds of events: interprocess communication (IPC) messages 
260 from agent sessions 262; hardware events 264 from the 
5 communications cards and other hardware 266; and timer 
events 268 from its internal time queue. 

Each action taken by an agent results in an IPC 
message to the VM. The VM receives this "event" and carries 
out the request. The VM may send IPC messages back to the 
10 agent session to communicate with it. The VM transmits 
parcels using IPC messages. For example, when an agent 
sends a parcel to another agent, the first agent sends an 
IPC message to the VM, and as a result, the VM forwards the 
IPC message on to the second agent. 
15 Output to and input from the hardware causes a 

hardware event. For example, when a caller begins speaking 
a hardware event occurs. 

Objects internal to the VM can set timeouts as part 
of handling other events. These events are placed on a time 
20 queue, and when the time expires, a timer event occurs that 
the VM handles like any other event. For example, when a 
user is pressing DTMF tones, a timer goes off if there is a 
long pause between tones. 

The VM must handle events quickly and efficiently to 
25 provide fast response time to agent requests. The VM 

listens for events and when one occurs, it blocks all other 
activity and responds to it. When it finishes handling an 
event, it listens for the next event. 

After the VM creates a session, the session sends an 
3 0 IPC connection request to the VM. A VMListener object 310 
in the VM receives this connection request, and the VM 
creates a VMServer object 312 for that session to use in 
communicating with the VM. It is the VMServer object that 
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receives the IPC messages from the agent's session, as shown 
in Fig. 17- The VMServer object takes the object ID and 
performs an algorithm on it to obtain the appropriate RPC 
target (VMSession 314 or VMChannel 280) in the VM. The 
5 VMServer object then passes on the message to the right 
VMSession or VMChannel. To do so, the VMServer calls tJie 
RPC target's do_rpc virtual member function, and the RPC 
target processes the message and carries out the operation. 
Each agent session has a corresponding VMSession 

10 object 314 in the VM. The VMSession object stores the 
current state of the session. This object also queues 
parcels destined for the agent when there is no session 
running for the agent. Once a session is running again, it 
sends the parcels. 

15 Any VM object can create a Timer object to set a 

timeout. The VM stores Timer objects on its timer queue 
288, and when the timer goes off, a timer event occurs, 
which the VM handles like any other event. 

Managing resources 

20 The VM manages several kinds of resources: those 

that can be directly manipulated by agents, those that can 
be indirectly manipulated by agents, and those internal to 
the VM. Fig. 17 shows these resources in the VM where there 
are two active agent sessions and a conference between the 

25 sessions. 

Agents can directly manipulate sessions, channels, 
conferences, and parcels. For each agent session, the VM 
maintains a communications channel for IPC messages and an 
object that keeps track of the state of the session. 

3 0 Each active gadget in an agent session corresponds 

to a channel 280 in the VM. As previously noted, a channel 
can have a set of ports 282 attached to it, with each port 
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providing one or more capabilities. A capability is a 
representation of an ability or feature or a channel, such 
as the ability to recognize speech or play audio. Agents 
request capabilities from the VM in order to communicate 
5 with their users. In response, the channel attaches one or 
more ports, where each capability corresponds to one or more 
ports . 

A conference 284 is an object that can connect 
multiple channels in a single session or in multiple 

10 sessions. For example, to arrange a conference call, an 
electronic assistant places calls to each participant and 
then uses a conference object to connect the participants 7 
channels into a single session. 

For each parcel sent by an agent, the VM transmits 

15 it and if necessary, queues it. 

Through the manipulation of capabilities agents can 
indirectly manipulate ports. A port is an object referring 
to an input /output device. A channel creates one or more 
ports for each capability requested by the electronic 

20 assistant and manages the ports throughout the connection. 
The port connects to the hardware device 286 that actually 
has the capability. Ports can also be thought of as 
filters. For example, a speech recognition port is not an 
audio port; it takes audio and then converts it (i.e., 

25 filters it) to a menu selection. 

The VM also has internal resources including timers 
288, a bus 300, and user info objects 302. A timer is an 
object that represents a timeout. For example, a timer 
object is set for the time between DTMF keys and the time 

3 0 between retrieving a parcel and actually receiving it. A 

bus is an object that represents a data and control hardware 
bus, such as an MVIP bus or a TCP/IP bus. All objects 
connected to the bus can communicate with each other using 
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the same protocol. A user info object is a cached pointer 
to the User object in the object database. A user info 
object is created for each user that calls their assistants 
and logs in. 

5 The VM allocates and deallocates sessions as they 

are needed. When an incoming call arrives or a parcel is 
delivered to a user that does not already have a master 
session, the VM starts a new session. 

A resource manager in the VM performs resource 

10 management functions. For example, it Keeps a pool of free 
and available recognizers and interactive-in and 
interactive-out resources 3 04. It manages the assignment 
and deallocation of these resources and it notifies 
requesters when resources (e.g. channels) are not available. 

15 It also holds onto reservations for resources when they are 
not currently available and as soon as a requested resource 
becomes available, it assigns that resource to the 
requesting session. It also deallocates the interactive-in 
and interactive-out ports and recognition ports when they 

20 are no longer needed for a channel, freeing them up for use 
by other channels. 

Conference Objects: 

Conference objects are used by an agent and the VM 
to connect together multiple gadgets and their respective 

25 channels. The agent and the VM have slightly different 
views of a conference object. The agents see only the 
active gadgets that the agent is in control of; while the VM 
sees all the active gadgets and their respective channels 
that are connected into a conference. 

3 0 The methods on conference objects are listed and 

described in the following table. 
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Method 


Description 


Add 


This is used to add an active gadget or a 
gadget into the conference. When a gadget 
is added , an active gadget is returned. 


Remove 


This removes an active gadget from a 
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communications paths for the conference. 
The gadget remains part of the conference 
but it is not connected to the streams of 
communication with other participants. 
This allows an agent to carry on a dialog 
with a specific active gadget without other 
participants of the conference hearing. 


Switchln 


This is the complement of SwitchOut. It 
switches a switched-out active gadget back 
into the media streams of the conference 
from which the active gadget had previously 
been removed. 


CreateConf erence 


This creates a new conference. 


ChangeConf erence 


All conference operations work on a current 
active conference. ChangeConf erence 
changes the active conference. 


Des troyConf erence 


This destroys a conference and deallocates 
all resources associated with that 
conference. 



In the described embodiment, active gadgets are 
10 always connected to a conference object ♦ In the case that 
there is only one active gadget, such as when a subscriber 
or in caller are conversing with an electronic assistant, 
then the active gadget is connected to a special form of 
conference called the idle conference. 

15 Allocating Channels and Ports: 

To speed up the allocation of sessions, the VM keeps 
a session pool 3 08 containing sessions that are not yet 
assigned to users. As noted earlier, the electronic 
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assistant handles an incoming caller using a new session. 
The VM allocated this session from its session pool. The VM 
creates the session pool at system startup time and refills 
it as necessary. Each session in the pool is for a specific 
5 type of agent. 

The VM also maintains channel resources, which are 
pools of channels or ports that the VM can allocate to 
active gadgets. The channel resource or port resource is 
responsible for managing the allocations and deallocations 

10 from its pool. 

In the described embodiment, upon start up of the 
system, the VM constructs the channels that are necessary to 
handle communications through the interfaces supported by 
the system. The channels which are constructed have three 

15 possible modes: a listen mode, an idle mode, or an outgoing 
call mode. In the listen mode, the associated interface 
card is monitoring the incoming line (e.g. the Tl line) for 
incoming calls. In the outgoing mode, the interface card is 
set up to initiate a connection to its communications 

20 channel. Idle mode is used during transitions. Upon start 
up, all of the channels that are constructed are put into a 
listen mode, ready to receive an incoming call. 

As illustrated in Figs. 18A-B, when an agent 150 
adds a gadget 336 to a session, the VM gets a channel 280 

25 (Fig. 18A) , makes a connection on the channel, and creates 
an active gadget 152 (Fig. 18B) that refers to the channel. 
The channel represents the actual connection used to 
communicate with the user. 

Agents can add input, output, and recognition 

30 capabilities to gadgets, and the VM attaches ports with 

those capabilities to the channel. There are three types of 
ports: input ports, output ports, and recognition ports. An 
input port records audio into a meme. An output port plays 
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audio from a meme. A recognition port uses input from the 
user to make a selection from a menu. 

Each port represents a capability of a physical 
device , such as a line processing card or a voice 
5 recognition interface card. When the VM attaches ports for 
a given capability, it usually attaches more than one port. 
For example, in the described embodiment when an agent 
requests the recognition capability, the VM adds a voice 
recognition port for recognizing speech and a DTMF 
10 recognition port for recognizing DTMF tones. When the agent 
requests the output capability, the VM adds multiple output 
ports, with each port handling a different audio format. 
This allows transparent support for hardware which uses 
different audio formats. 

15 Presenting Memes and Menus: 

When the agent prompts the user with memes and 
menus, each port responds to the media or mogul that it 
knows how to play or recognize. For example, the output 
port plays any audio media for system prompts, while the 

2 0 recognize port performs speech recognition to determine the 

user's selection from a menu. For each channel, only one 
type of media is presented from each meme. When presenting 
memes, only one output port can present a meme at a time. 
When analyzing a response, however, multiple recognition 
25 ports can be listening at the same time; for example, one 
can be listening for speech and the other for DTMF tones. 
If DTMF tones occur, the DTMF recognition port cancels the 
speech recognition port that was listening for speech. 
Likewise, if speech occurs first, it cancels the DTMF 

3 0 recognition port. 

For example, consider the "Call Whom?" meme 
previously discussed. Referring to Fig. 19, when the 
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presenter 160 passes the "Call Whom" meme 192 and contact 
menu 194 to the active gadget 152 , the VM presents the meme 
to the collection of ports 282 attached to the channel 280. 
The audio output port knows how to play this meme, so it 
5 takes the meme and plays it. Next, the VM presents the menu 
to the collection of ports attached to the channel. Both 
the DTMF and VPC recognition ports know how to use this 
menu, so they take it and begin listening for DTMF tones or 
speech (respectively) . Once the VPC recognition port 

10 detects speech, it cancels the DTMF recognition port, 

matches the speech against the menu, and returns the menu 
pick to the presenter. 

When the VM presents a meme list to a collection of 
ports, each output port looks at the first meme in the list 

15 and any successive memes that have the same audio format. 
If an output port can play the memes , it removes them from 
the meme list, plays the memes in sequence, and returns the 
remainder of the list to the VM. The VM presents the 
remainder of the list to the ports again, and the process 

20 continues as before until there are no memes left on the 
meme list. 

Switching channels 

The VM is also responsible for connecting and 
switching calls. It keeps track of all connections between 

25 gadgets and it uses the conference object to connect and 
switch calls. A conference object can connect multiple 
channels into a single session. For example, to arrange a 
conference call, an electronic assistant places calls to 
each participant and then uses the conference object to 

30 connect the participants' channels into a single session. 
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Detailed Architecture an d Internal Design of Aoe^ts 

The system agents use a variety of application 
services to carry out its duties. Fig- 20 shows the 
functional relationships between the agents and these 
5 application services. 

Tasks 3 50 are part of an assistant layer 352 but are 
not used by the utility agents 3 54. The assistant uses the 
MMUI 356, parcels 358, database utilities 360, and 
miscellaneous utilities 362, which in turn depend on the VM 

10 364 and object database 3 66, Database utilities are a set 
of macros for accessing the database, while miscellaneous 
utilities are a set of standard data structures and 
containers (such as strings and bit arrays). The utility 
agents use parcels and database utilities and other VM 

15 services (e.g. access to channels, gadgets and active 

gadgets) carry out their duties. The parcel mechanism is 
layered on top of both the VM and the database utilities. 
Although the functional diagram shows the agents 
relationship with the rest of the architecture, it does not 

20 show the dynamics of a live session. 

Fig. 21 shows a sample assistant session. At the 
bottom of the session, the MMUI, VM and parcel libraries 
368, 370 and 372 are linked in. The active gadget 374, 
which the assistant uses to communicate with the user, is 

25 layered directly on top of the channel 376 itself. When the 
assistant takes a task 377 off the task list 378, the task 
and presenter 38 0 may pass menus 382 and memes 384 to the 
active channel or send parcels. In either case, the VM 
library sends an IPC message 386 to the VM, and the VM 

30 carries out the action. As part of carrying out the 

assistant's actions, the VM may send IPC messages back to 
the session. The VM library converts these messages back to 
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the objects familiar to the assistant. The task may return 
menu picks, memes, and parcels to the assistant. 

As noted earlier, conceptually, there are three 
kinds of agents, namely, agents, user agents, and 
5 assistants. In the class hierarchy, however, only agents 
and assistants have classes; user agents do not have a 
class* 

The Agent class 398 is the base class for all agents 
and assistants. Fig. 22 shows the Agent class and other 

10 closely related classes, including, Session, Acti veGadget , 
Parcel, and Task classes. Also, there is a PersistAgent 
class, which holds the persistent data for an agent. (In 
Fig. 22, solid lines represent subclasses and dashed lines 
represent friend classes.) 

15 Generally, the most important operations that are 

carried out by an agent are performed on the Session, Agent, 
or ActiveGadget, as outlined in Table 3. 



Class 


Operations Performed | 


Session 


Manipulating gadgets | 
Handling parcels 


Agent 


Handling tasks 

Handling contents of parcels 


ActiveGadget 


Presentation of prompts 
Collection of responses 
Manipulating capabilities 
Training 



TABLE 3 . 



25 



The following subsections describe SessionAgent , 
ActiveGadget, PersistAgent, Task, Parcel, and ParcelContents 
classes in more detail. 
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The Session Class: 

The Session class is a subclass of the Client class, 
which provides it with the capability of being an RPC client 
in the VM. The session provides the thread of execution for 
5 the agent and keeps track of the Interprocess Communication 
(IPC) connection to the vmserver process. 

The Session class consists of the following data 
members : 

• A socket, which is used for communicating with 
10 the vmserver process (inherited from Client 

class) 

• An object ID (inherited from Client class) 

• A pointer to itself (inherited from 
Client class) 

15 • A reference to the user object 

• A pointer to the focus gadget 

• A list of gadgets 

• A list of parcels 

The Session class contains member functions for: 

20 • Managing gadgets. The set includes functions 

for adding, removing, and connecting gadgets 
and getting and setting the focus gadget. 

• Handling parcels. The set includes functions 
for sending parcels, retrieving parcels, 

25 replying to parcels, and queue ing parcels. 

• Making a master session out of the current 
session. 

The Agent class: 

The Agent class is a subclass of both the Typed and 
30 ParcelHandler classes. The Typed class is the primary bass 
class. It provides run-time checking and member functions 
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for downcasting objects. The ParcelHandler class is a base 
dispatcher class for handling parcels and gives agents the 
ability to communicate with one another using parcels. 

The Agent class consists of the following data 
5 members : 

• A name (inherited from Typed) 

• A type ID (inherited from Typed) 

• A pointer to the session (inherited from 
ParcelHandler ) 

10 ♦ A timeout (inherited from ParcelHandler) 

• A task stack, which contains a linked list of 
task objects to be executed by the agent 

• A pointer to the active gadget 

• A pointer to the persistent agent 
15 The Agent class contains member functions for: 

• Handling tasks* This set includes functions 
for adding tasks, removing tasks, getting the 
next task from the task stack, performing a 
task, executing a task, and cancelling a task. 

20 • Returning the active gadget and persistent 

agent . 

• Handling each type of parcel contents 
(ParcelContents class) . These member functions 
are inherited from the ParcelHandler class. 

25 Each subclass of Agent contains any objects belonging to 
that type of agent. For example, the subclass for the 
assistant contains the hand and hand history, while the 
subclass for the courier agent contains the list of parcels 
to be delivered. In addition, each subclass of Agent 

3 0 contains methods for all actions carried out by the agent 
that are not tasks. 
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All agents follow the same basic skeleton of 
operations* Specifically, an agent carries out the 
following operations in its main processing loop: 

Creates a session 

Creates an agent session within the session 

Enters the perform method to perform tasks 

Continues performing tasks until there are no 
more tasks or an exception is received 

Deletes its session 

Exits 



The Agent :: per form ( ) method gets the next task on 
the task stack and performs it until there are no more tasks 
on the task stack. Utility agents, such as the cron agent 
and postmaster agent, however, do not use the task stack or 

15 tasks. Each utility agent knows how to handle a single type 
of parcel, so it provides its own perform () method that 
waits to receive a parcel and then services it. 

The main routine for a complex agent, such as the 
electronic assistant, has the basic operations just 

20 presented, but includes many additional operations, such as 
ones for initializing the memes in the database, looking up 
the slot of the vmserver process, looking up the subscriber 
to which it belongs, getting the focus gadget, and so on. 

The ActiveGadget class: 
25 _ The ActiveGadget class is derived from the DB class, 

although it is not actually stored in the database 
currently. The DB class is the primary subclass of Typed and 
it contains al of the objects stored in the database. The 
ActiveGadget class consists of the following data members: 
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« A pointer to the channel in the VM 

• A pointer to the meme list 

• A pointer to a menu 

The ActiveGadget class contains member functions for: 

5 • Presenting a meme and collecting a response 

from a menu. 

• Adding and removing capabilities. 

• Training utterances. The set includes 
functions for starting training, collecting 

10 utterances, adding training, completing 

training and aborting training. 

The ActiveGadget class defines operators for: 

• Presenting information to the user («) 

• Recording information from the user (») 

15 Gadgets store the "address" at which a user can be 

reached. There are the following types of gadgets: 

PhoneGadget, which describes the address of a 
telephone — the prefix, country code, city code, 
area code, number, and extension. 

20 AgentGadget, which describes the address of an 

assistant — a pointer to its subscriber's user 
object. 

ArdisGadget, which describes the address of an Ardis 
pager • 

25 SkyTel PagerGadget, which describes the address of a 

SkyTel pager — a pager ID and security ID. 

NumericPagerGadget , which describe the address of a 
numeric pager — a phone number. 

SocketGadget , which describes the address of a data 
30 network — a hostname, service, and service type. 
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Theoretically, any agent can use any type of gadget. 
Certain agents, such as the courier agents, are specialized 
to deal with particular kinds of gadgets. For example, the 
system courier agent knows how to deal with the AgentGadget, 
5 while the Skytel courier agent and Numeric Pager courier 
agent know how to deal with the SkyTel PagerGadget and 
NumericPager Gadget respectively. 

To use a gadget to communicate with a user, an agent 
adds it to the session to obtain an active gadget. Then, 
10 the agent can send and receive information over the active 
gadget . 

The PersistAgent class: 

The PersistAgent class has no data members or member 
functions. Each subclass of PersistAgent provides data 
15 members for its agent's persistent data and member functions 
for accessing the data. For example, the persistent 
assistant contains the subscriber's box, while the 
persistent cron agent contains a list of cron events. 

The Task class: 

2 0 The Task class is derived from the both the DB class 

and the FieldParent class. The FieldParent class gives a 
task a way of knowing whether it is changed, completed, or 
cancelled. 

The Task class consists of the following data 

2 5 members : 

• What has changed (inherited from FieldParent) 

• Whether the task is complete (inherited from 
FieldParent) 

• Whether the task has been cancelled (inherited 

3 0 from FieldParent) 
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• A pointer to the agent 

• A pointer to the active gadget 

• A pointer to the session 

The Task class contains member functions for: 

5 • Processing executing, and cancelling tasks* 

These functions are protected and accessed only 
by the Agent class. 

• Returning and setting the active gadget. 

• Returning and setting the agent. 

10 • Returning and setting the session. 

Each subclass of Task contains protected member functions 
for processing and executing that task. 

The Parcel class: 

The Parcel class is derived from the DB class and 
15 has the following data members: 

• The address of the recipient 
« The address of the sender 

• A parcel ID 

• A message ID 

20 ♦ A reference to the contents of the parcel 

The Parcel class has member functions for accessing data 
members of the class. It also has operators for: 

• Filling an RPC output buffer with the parcel 
(«) • 

25 # Removing a parcel from an RPC input buffer 

(») • 
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The ParcelContents class: 

The ParcelContents class is derived from the DB 
class, and is an abstract class. The ParcelContents class 
contains no data members and a single member function for 
5 processing the contents of a parcel. 

Each subclass of ParcelContents contains data 
members appropriate for the parcel type, such as a reminder 
or even another parcel. Each subclass also has a member 
function for processing its particular content. 

10 Setting up the focus gadget: 

The focus gadget is a specific active gadget that an 
assistant must use to communicate with a user. In the case 
of a phone gadget, the focus gadget has speech-recognition 
resources associated with it. 
15 When the VM starts a session for an incoming call, 

it equips the session with a focus gadget, Focus, which is a 
data member of Session. For phone calls, this focus gadget 
is of type PhoneGadget. The assistant uses this focus 
gadget to communicate with the subscriber, contact, or non- 
20 contact that is calling. Before the assistant can use the 
focus gadget supplied with the session, it must take the 
following actions to set it up: 

1. Obtain the focus gadget with the get_focus member 

function. 

25 2. Set any attributes needed on the focus gadget. 

3. Add any necessary capabilities to the focus gadget. 

4. Answer the call on the focus gadget. 

In the case where a session is started because of a parcel 
delivery, the VM does not create a focus gadget. In this 
30 situation, the assistant attempts to locate and notify the 
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subscriber by making an outgoing call. Before the assistant 
can communicate with the person being called, it must take 
the following actions to set up a focus gadget: 

1. Determine the gadget that corresponds to the 
5 subscriber's location. 

2. Obtain a focus gadget by adding the gadget to the 
session* 

3 # set any attributes needed on the focus gadget. 

4. Add any necessary capabilities to the focus gadget. 

10 Once the focus gadget is set up, the assistant can 
communicate with the user using memes and menus and 
manipulate the gadgets as necessary to make additional 
calls, place users on hold, and connect users. 

Obtaining additional active gadgets: 

15 Theoretically, agents can add any gadget to a 

session with the Session: add (Gadget) member function. When 
the agent adds the gadget, it receives an active gadget in 
return, and the focus is already set to that active gadget, 
making it the focus gadget* 

20 Each standard place (home, work, car, mobile, other) 

for each contact is represented by an addressable gadget 
object (e.g. a PhoneGadget object) that can be added to a 
session. For example, if a subscriber asks the assistant to 
place an outgoing phone call to a contact, the assistant 

25 obtains the phone gadget for the correct place (such as 
work) , and adds it to the session. 

When the agent adds a gadget to a session, the VM 
obtains a phone line and dials the number specified in the 
gadget. The VM does not create an active gadget and give it 

3 0 to the agent until the call connects successfully. The 

criteria for successful connection is variable; it can mean 
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the phone rang several times or that someone answered the 
phone. If the call fails to connect, the VM does not create 
an active gadget, and add operation fails for the agent. 

Setting attributes and capabilities: 
5 Before the agent communicates over the active 

gadget, it must set any attributes and capabilities that it 
needs for communicating with the user. 

Attributes specify which media get presented in a 
meme and can be thought of as properties of MMUI objects. 
10 The language and type of prompts (brief, dialog, 

instructional) used by the assistant are examples of 
attributes. Most attributes are set as part of the user 
preferences mechanism in the assistant. When the assistant 
starts up, it sets attributes on the focus gadget for each 
15 user preference. Whenever the user changes a preference, 
the assistant sets the appropriate attribute. 

The VM currently provides the following set of 
capabilities: 

Wf InterOutCap, which provides audio output 
20 • Wf InterlnCap, which provides^ audio output 

Wf Recognizer Cap, which provides speech and DTMF 
recognition 

WfFaxInCap, which provides fax input 

WfFaxOutCap, which provides fax output 

25 • WfAllCaps, which provides all capabilities 

The assistant can set capabilities when it sets up the focus 
gadget or anytime during the session with the user. For 
example, if the user asks to fax some information, the agent 
can add the fax capability to the active gadget for carrying 
30 out that user request. 
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Answering the focus gadget: 

For incoming calls, the assistant must answer the 
focus gadget with the ActiveGadget: : answer member function, 
which picks up the call and stops the ring tone. After the 
assistant answers the call, it can begin communicating with 
the user or callee (e.g. it could be a fax machine that 
called) . 

Connecting gadgets: 

Agents can connect, disconnect, and reconnect 
gadgets as needed during a session. To connect gadgets that 
are located in separate sessions, agents use a separate 
object, the Conference object. Once a connection is 
established, agents can disconnect and reconnect gadgets to 
place users on hold and connect them back in. 

Parcels: 

Agents use parcels to communicate with one another. 
A parcel consists of a target (addressee) , which can be a 
user or a session, and content (ParcelContents class) . 
Parcels are delivered to the addressee's session. If the 
addressee is an agent that can have more than one session 
running, the parcel is always delivered to the master 
session. 
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Table 5 describes the parcels that are used by 



agents . 

Parcel /Contents Types 



ParcelContents 


Description 


Sending or 
Receiving Agents | 


ConnectParcel 


Contains the caller 
requesting a connection and 
their name recording, a 
conference object, and the 
response from the callee 
and the caller. The 
assistant uses this parcel 
to negotiate connections 
between incoming callers 
and subscribers. 


Assistant 


CourierEvent 


Contains a bond to the 
message to send, the list 
of addressees to which to 
send the message, and the 
number of retries left. 
The Postmaster agent uses 
this parcel to submit 
messages to courier agents 
for delivery. 


Postmaster 
agent, SkyTel 1 
courier agent, 1 
system courier 1 
agent 8 


CronEvent 


Contains a parcel to be 
delivered and the time at 
which it should be 
delivered. The assistant 
uses this parcel to submit 
reminders (or any other 
items that need delivery at 
a specific time) to the 
Cron agent. 


Assistant, Cron 
agent 


MSFEvent 


Contains a bond to a 
message. The assistant 
uses this parcel- to submit 
messages to the Postmaster 
agent for delivery. 


Assistant, 
Postmaster agent 


NewMailEvent 


The courier agents uses 
this parcel to indicate to 
the assistant that they 
have just delivered a 
message. 


Assistant, 
system courier 
agent, SkyTel 
courier agent, 
Secret agent 
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ParcelContents 


Description 


sending or I 
Receiving Agents [ 


PostalEvent 


Contains a bond to the 
message to send, the list 
of addresses to which to 
send the message, and the 
number of retries left* 
Courier agents use this 
parcel to request that the 
Postmaster retry delivery 
of a message. 


Postmaster 
agent, SkyTel 
Courier agent 


ReminderEvent 


Contains a reminder. The 
assistant uses this parcel 
to package up a reminder. 


Assistant 



Assembling parcels 

To assemble a parcel, the agent uses the Parcel 
5 constructor and specifies the ParcelContents class as an 

argument. The agent can then fill in the appropriate fields 
in the parcel's contents. 

Sending and receiving parcels 

To send a parcel, reply to a parcel or retrieve a 
10 parcel, the Session class provides the following member 
functions: 

send_parcel 

reply_to_parcel 

retrieve_j?arcel 

15 When replying to a parcel, the VM automatically addresses 
the parcel to a specific session rather than a specific 
addressee. 



Passing the same parcel back and forth: 
Agents can pass a parcel back and forth, amending 
20 its contents each time, as a means of negotiation. As an 
example of using parcels in this manner, consider how the 



- 64 - 



WO 96/11542 



PCI7US95/11737 



assistant handles calls. When a subscriber calls a contact, 
there is no negotiation involved in connecting the telephone 
call. There is a three-part handshake involved in 
connecting a contact to a subscriber in the following two 
5 situations: 

An outside contact calls a subscriber; 

A contact that happens to be a subscriber calls 
another subscriber who is logged into the system 

Referring to Figs. 23A-D, the three-part handshake involves 
10 the following steps: 

1. When a contact calls a subscriber, the electronic 
assistant 4 00 for the contact's session 4 02 sends a 
connect parcel 404 to the master session 406 (Fig. 
23A) . The parcel indicates that the caller is 

15 requesting a connection to the subscriber and 

contains a reference to the contact (or a dummy 
contact if the caller is not a known contact) . If 
there is no master session, the VM starts one. 

2 . When the electronic assistant for the master session 
20 406 receives the connect parcel 404, it checks to 

see if it has the focus gadget 408, which indicates 
it already has a session running with the 
subscriber. If the electronic assistant does not 
have the focus gadget, it means that the session was 

25 started because it received a parcel, and it needs 

to try to locate and notify the subscriber in order 
to get the focus gadget. Once the electronic 
assistant has the focus gadget, it asks the 
subscriber whether or not to connect to the contact. 

30 if the subscriber says yes, the electronic assistant 

for the master session sends back the connect parcel 
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404 and indicates within it that the connection 
request has been accepted (Fig. 23B) . 
3 # When the electronic assistant for the contact's 

session receives the connect parcel, it sends it 
back with the indication to go ahead and make the 
connection (Fig. 23C) . When the electronic 
assistant for the master session receives the 
connect parcel, it connects the two gadgets and 
moves the contact's active gadget into the master 
session (Fig. 23D) . 



The way in which conference objects are used 
handling communications is illustrated in Figs. 25A-D, which 
diagram the steps in creating a conference object for 
establishing a call from one subscriber to another 

15 subscriber. In this example, it is assumed that both 
subscribers are initially interacting only with their 
electronic assistants and thus are connected to idle 
conference objects. This is illustrated in session A by a 
focus gadget 4 01a connected to an idle conference object 

20 403a and in session B by a focus gadget 4 01b connected to an 
idle conference object 403b. 

Subscriber A initiates the call by instructing his 
electronic assistant to establish a connection to subscriber 
B. In response, the VM checks whether subscriber B is in 

25 session, i.e., whether a master session is running for 
subscriber B. If subscriber B is in session, the VM 
notifies subscriber A's electronic assistant that subscriber 
B is in a session on the system. This information is 
communicated to subscriber A, who then is given the 

3 0 opportunity to confirms his request for a connection. When 
subscriber A confirms his request for a connection, his 
electronic assistant sends a conference request 4 05 to 
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subscriber B's electronic assistant and the VM creates a 
conference object 4 07 for the requested connection (Fig. 
24A) . 

If subscriber B indicates to his electronic 
5 assistant that he will accept the requested connection, his 
electronic assistant sends an acknowledgment (ACK) back to 
the electronic assistant for subscriber A (Fig. 24B) . In 
response to receiving the ACK from subscriber B, subscriber 
A's electronic assistant constructs a conference parcel 409 

10 identifying the conference object 407 which has been created 
for the connection, connects itself into the conference 
object 407, and sends the conference parcel 409 to 
subscriber B's electronic assistant (Fig. 24C) . 

When the electronic assistant for subscriber B 

15 receives the conference parcel from subscriber A, it 

extracts the identity of the conference object from the 
parcel, removes its active gadget from its idle conference 
object, and adds its active gadget to the newly received 
conference object 407 (Fig. 24D) . At this point, 

20 subscribers A and B are switched together and their 

electronic assistants let them know that they are connected 
by saying "Go ahead" . 

The Functionality of The Electr onic Assistant 

The set of commands that are accessible to the 
25 subscriber are presented in Exhibit A. The various tasks 
which the electronic assistant can execute implement the 
functionality associated with these commands. 

The following are more detailed descriptions of 
various functions that the electronic assistant performs for 
30 its subscriber using the mechanisms described above. The 
functions that are described include among others: (1) 
handling an incoming call to a subscriber, (2) creating a 
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contact, (3) creating a reminder, and (4) notifying the 
subscriber of a reminder. 

Handling an Incoming Call: 

First, we describe how the system handles an 
5 incoming call from a caller (i.e.. Bill Bishop) to a 

subscriber (i.e., Jim Smith). The system's call handling 
functions are invoked when a caller places a call to the 
subscriber's phone number. The central office notifies the 
system of an incoming call by placing a ring signal on a Tl 

10 line into the system. The interface card that is monitoring 
that Tl line responds to the ring signal by picking up the 
line and acquiring a 4 -digit sequence of touch tones 
identifying the extension that is being called. The 
interface card generates an event for the virtual machine. 

15 The virtual machine determines from the hardware what number 
was called and from that determines the identity of the 
subscriber being called. 

Upon receiving an indication of an incoming call, 
the VM starts up a session, i.e., a single thread of 

20 execution, that will be assigned to handling that call. 

When the VM server spawns the session, a handshake occurs 
between the session and the VM server to enable the two 
entities to communicate with each other. As part of the 
handshake, the server process passes to the session an ID of 

25 the subscriber for whom the session is being started up. 
Thus, the session is able to immediately establish a 
reference to a user object in the database that contains the 
stored information for the subscriber. 

In addition to receiving the ID of the user for 

3 0 which the session has been started, the VM also passes up 
the identity of the channel out of which it constructed an 
active gadget. The active gadget represents the 
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communication path to the incoming caller. The active 
gadget represents the particular communications line that 
the call came in over, in this case a phone line. Until the 
capabilities are added, the active gadget simply represents 
5 a channel with no input or output capabilities. Therefore, 
to enable the system to play prompts, record incoming 
signals, and perform voice and DTMF recognition on the 
channel, the assistant adds capabilities to the active 
gadget. For example, the assistant adds interactive-out 

10 capability, interactive-in capability and recognition 

capability. The capabilities that are added to the active 
gadget map to ports which the VM attaches to the channel. 
For example, interactive-out capability maps to an audio-out 
port for playing audio, the interactive-in capability maps 

15 to audio- in port for recording speech) , and the recognition 
capabilities map to speech recognition and DTMF ports. 

Once the capabilities are set up, the electronic 
assistant invokes an answer call task to handle the incoming 
call. Referring to Figs. 24A-B, the answer call task 

20 engages in a dialogue with the caller to collect certain 

information which will be necessary to beginning execution 
of the task. In general terms, the answer call task 
initially has two fields of information that must be filled 
in: WHO is calling and WHAT does the caller would like the 

25 electronic assistant to do. 

When an incoming call arrives for a given subscriber 
there are a number of ways in which the assistant might 
handle the call, depending on the preferences which the 
subscriber has previously selected. The assistant might 

3 0 directly forward the call to a telephone on the subscriber's 
desk phone, it might simply offer to take a message from the 
incoming caller, or it might attempt to locate the 
subscriber and offer to connect him to incoming call once he 

- 69 - 



SDOCID: <WO 961 1542A2_I_> 



WO 96/11542 



PCT/US95/11737 



is located. In handling the call, the answer call task 
first checks the subscriber's status to determine which 
preference he has selected (step 500). 

In the described embodiment, it is assumed that the 
5 caller wishes to have a connection established with the 

subscriber, thus the caller is not offered any options for 
responding to the second question. However, in alternative 
embodiments, the answer call task could be modified to offer 
the caller the option to leave a message, to specify the 

10 priority of the call, or to send a page, among other things. 
Also note that if ANI (automatic number 
identification) is supported, the interface card could have 
stripped off the ANI signal that it received from the 
central office and supplied it to the electronic assistant 

15 f Qr the session. If that number mapped to a contact in the 
subscriber's contact list, the electronic assistant could 
supply the "Who?" entry automatically without engaging in 
dialog with the caller. Even in that case, however, it may 
still be desirable to engage in a dialog with the caller to 

20 confirm the caller's identity. 

After the answer call task determines that the 
subscriber wants calls screened, the answer call task 
attempts to identify the caller (step 504) . The electronic 
assistant instruction passes an object (i.e., a mexne object) 

25 to the active gadget instructing it to say "Please say your 
name.* 1 It also passes the subscriber's contact menu to the 
gadget so that the caller's reply can be matched against the 
subscriber's contacts to determine whether or not the caller 
is a known contact- The electronic assistant asks the 

30 gadget to return the identified name. 

The gadget presents the "Please say your name" meme 
to the channel. The channel presents the meme to the 
collection of attached ports and the appropriate port plays 
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the meme by retrieving the stored audio version from the 
systems database. 

Upon receiving the caller's spoken response, the 
channel attempts to match the response against responses 
5 stored for known contacts in the contact menu. The result 
which is passed up to the electronic assistant will either 
be the . identity of a known contact or an indication that its 
an unknown contact. If it is an unknown contact, the answer 
call task invokes a take a message task which offers the 
10 caller the option to leave voice mail for the subscriber 
(step 506) . If the caller is a known contact, the 
electronic assistant determines whether the known contact 
is, in fact, the subscriber for whom the electronic 
assistant is acting (step 508). If the electronic assistant 
15 determines that the caller is the subscriber, the electronic 
assistant authenticates the subscriber by asking for a 
password before allowing him to have access to the system 
commands (step 510) . Once the identify of the subscriber is 
verified, the electronic assistant notifies the subscriber 
20 of all new messages that have been stored since he last read 
them and it starts the command task which gives the 
subscriber access to the set of commands for controlling the 
electronic assistant (step 512). 

If the caller is not the subscriber, the electronic 
25 assistant checks the subscriber's contact list to determine 
the priority of the caller (step 516) . Recall that the 
subscriber's contact list includes the subscriber's 
designation of each contact's priority (i.e., normal or 
high) . The priority indicates the importance of calls that 
3 0 are received from the caller. 

While checking the priority of the caller, the 
electronic assistant also checks the data structure for the 
contact to determine whether there are any voice messages 
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left for the caller by the subscriber (step 518). If there 
are voice messages, the electronic assistant invokes a 
deliver message task which asks the caller whether or not he 
wishes to hear the voice messages left by the subscriber 
5 (step 520) • 

The deliver message task performs the following 
sequence of events. First, the electronic assistant sends a 
meme to the channel instructing it to say to the caller M I 
have a message, would you like to receive it?". it also 
10 passes a pointer to the stored voice message in its message 
database. The caller responds by pressing the appropriate 
buttons on the touch-tone phone. If the caller presses "9" 
for "yes", the audio-out port retrieves and plays the 
appropriate media representation of the stored message. 
15 After the message is played, the electronic assistant checks 
whether there are any other unplayed messages, and if there 
are, it goes through the same sequence of operations to 
present the other messages to the caller. 

Note that the deliver-message task of the described 
20 embodiment does not perform any verification of the caller's 
identity. It may, however, be desirable to do so. This can 
easily be done by modifying the task to request a 
verification such as a password. 

After the deliver message task is complete or if 
25 there are no voice messages for the caller, the electronic 
assistant checks the availability of the subscriber (step 
522) . The subscriber has the ability to designate his 
availability through the "I Will Be" command (described 
elsewhere) . This information is stored in the database as 
3 0 part of the subscriber's user object. 

After identifying the subscriber's availability, the 
electronic assistant determines whether or not to attempt to 
establish a connection to the subscriber (step 524) . In the 
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described embodiment, the determination is based on whether 
the caller has high enough priority to meet the availability 
criteria established by the subscriber. For example, if the 
caller is identified in the database as a normal priority 
5 contact and the subscriber has indicated his availability to 
be "taking important calls", then the electronic assistant 
does not attempt to establish a connection to the subscriber 
for this particular call. On the other hand, if the 
contact's priority is designated as high priority, then the 

10 electronic assistant performs the operations necessary to 
establish a connection with the subscriber. 

The decision algorithm in the described embodiment 
is a very simple one. Alternative embodiments might use 
much more elaborate algorithms to filter the caller's 

15 request for a connection to the subscriber. For example, 

the decision could take into account other information about 
the caller such as his job description, or the identity of 
the company from which he is calling. Also, it would be a 
simple matter to let the priority of the caller change to 

2 0 reflect the number of calls that the subscriber has 

initiated to that caller within some preselected period of 
time. A contact that the subscriber calls frequently would 
have a priority that is higher than a contact to whom the 
subscriber rarely places a call. 

25 if the decision is made to not attempt a connection 

to the subscriber, the electronic assistant invokes a take 
message task (step 526). In general, the take message task 
reports to the caller that the subscriber is not available 
at that moment and asks the caller whether or not he wishes 

30 to leave voice mail for the subscriber. After the take- 
message task is complete, the electronic assistant performs 
whatever operations are necessary to disconnect from that 
channel and terminate the session for that caller. This 

- 73 - 



tSDOCID. <WO 961 1542A2_I_> 



WO 96/11542 



PCTAJS95/11737 



includes, for example, freeing up the resources that were 
required to support the connection from a caller by 
stripping off the capabilities that are associated with the 
active gadget and deallocating the audio-in, audio-out, and 
5 recognition ports from the channel so that they may be used 
by other sessions. 

If the subscriber has indicated that he is available 
to receive calls from that contact, the answer call task 
performs the steps that are necessary to establish the 
10 connection (step 528). To further illustrate the operation 
of parcel mechanism, the exchange of parcels that takes 
place to assist in establishing the connection will also be 
described* 

The answer call task in the caller's session sends a 

15 connect parcel addressed to the subscriber. The connect 
parcel requests that a connection be established with the 
subscriber and it identifies the caller's session. The VM 
handles the delivery of the parcel. If the VM determines 
that no master session currently exists, it spawns a new 

20 session which it designates as the master session and then 
delivers the connect parcel to that master session. 

Once the master session exists, it immediately 
checks whether any parcels have been delivered to it. At 
this point it will discover the connect parcel that was sent 

25 by the caller's session. In response to the connect parcel, 
the master session checks whether it has the focus gadget. 
If it does not, it invokes a locate-and-notify task the 
purpose of which is to attempt to establish a connection 
with the subscriber. 

3 0 Referring to Fig. 26, the locate-and-notify task 

first attempts to determine where the subscriber can be 
reached (step 600) . In the described embodiment, the 
assistant checks the subscriber's user object for stored 
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schedules. There are two schedule data structures which are 
used to list the schedule for a user: a default schedule and 
an override schedule. The default schedule indicates where 
the subscriber can typically be found throughout a normal 
day or throughout the days of a normal week. The override 
schedule that the subscriber can generate through the «i- 
Will-Be» or "Create-an-Itinerary» commands, overrides the 
default schedule for the relevant periods of time. For 
example, the subscriber may usually be in his office on 
Mondays and his default schedule will reflect this. 
However, on a particular Monday he may have to drive 
elsewhere to visit a client. He can generate an override 
schedule to reflect this change from normal routine. The 
override schedule would indicate that he is reachable 
through his mobile phone for the period of time that he is 
expected to be traveling by car and that he is thereafter 
reachable at the client's business location for another 
period of time. 

The subscriber could also set his location as being 
with a second subscriber whose schedule and whereabouts are 
known by the electronic assistant for the second subscriber. 
Once this is done, the schedule of the first subscriber will 
track the schedule for the second subscriber who the first 
subscriber is with. 

The locate-and-notify task checks the subscriber's 
override schedule to determine whether the subscriber has an 
override for that time. If no override exists, the 
electronic assistant checks the default schedule. if the 
default schedule identifies a location for that time, the 
electronic assistant can find the telephone number for that 
location from the information stored in the subscriber's 
user object. For example, if the default schedule indicates 
that the subscriber is scheduled to be at home at that 
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moment, the assistant looks in the user object to find the 
telephone number associated with the home location, assuming 
of course that the subscriber has provided one. 

Once the appropriate communications device and its 
5 number (or address) is identified, the electronic assistant 
adds a gadget representing that device to the session (step 
604). if the communications device is the subscriber's home 
phone, the gadget that is added to the session is a phone 
gadget that contains the phone number for the subscriber's 
10 home phone. 

In response to receiving a gadget from the master 
session, the VM allocates a channel to that gadget. since 
the gadget is a telephone, the VM causes the appropriate 
interface card to allocate a phone line channel connecting 
15 to a Tl line. The VM also passes the telephone number to 

the interface card and the interface card dials that number. 
If the called number is busy, the channel reports back to 
the master session that the attempt to connect failed. If 
the phone begins ringing, the channel returns an active 
focus gadget to the master session. The active gadget 
represents an actual connection to the subscriber's home 
phone . 

Upon receiving the active gadget, the electronic 
assistant immediately assigns capabilities to the active 
gadget to enable it to communicate through the gadget and 
waits for somebody to pick up the phone (step 606) . if a 
party answers, the electronic assistant determines whether 
the user is available (step 608). it does this through a 
dialog during which the answering party can indicate whether 
30 the subscriber is there to receive the call. in the 

described embodiment, the electronic assistant causes the 
audio-out port to play the previously described stored 
message : 
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Hello, I'm trying to reach Jim Smith. if he is 
available, press the 9 key. if he is not available 
press the 6 key or hang up. ' 

It might be desirable to tailor the dialog to take 
5 into account the location of the phone that was just called. 
For example, one might wish to designate phones as being in 
friendly territory, questionable territory, or hostile 
territory. A phone that is in friendly territory might be 
any phone for which it is likely that the person answering 
10 it is familiar with interacting with the electronic 

assistant. A phone that in questionable territory might be 
is any phone for which there is a 50/50 chance that the 
answering party is not experienced in interacting with the 
electronic assistant. Phones in hostile territory might be 
15 any phone for which there is a reasonably high likelihood 
that the answering party is not experienced in interacting 
with the electronic assistant. By identifying the 
connection devices in this manner, an appropriate dialog for 
interacting with the answering party can be selected. For 
the inexperienced user, the electronic assistant might use a 
verbose dialog which explains how to interact with it more 
fully. Whereas, for the experienced user, it might use a 
terse dialog which assumes that the person knows how to 
respond without being told. 
25 If the answering party indicates that the subscriber 

is not available, this is reported to the master session 
which then sends a reply parcel back to the caller's session 
indicating that the connection attempt failed (step 610) . 
The master session then goes through the sequence of steps 
that is necessary to terminate the session. This includes 
stripping the capabilities from the active gadget. 
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If the answering party indicates that the subscriber 
is available, then the electronic assistant announces the 
identity of the caller to the subscriber (step 612) : 

There is a call from Bill Bishop. Do you want to take 
5 the call? Indicate Yes by pressing the 9 key, indicate 

No by pressing the 6 key. 

It then asks the subscriber whether he wishes to accept the 
call (step 614) . 

If the subscriber declines the call, the locate-and- 

10 notify task sends a failed connection reply parcel back to 
the caller's session (step 616) . In response to the fail 
reply, the caller's session informs the caller that the 
subscriber could not be found and it offers the caller the 
option of leaving a recorded voice message for the 

15 subscriber. If the caller accepts the message option, the 
electronic assistant records the caller's voice message and 
stores it for the subscriber to play back at some later 
time. 

While the connection to the subscriber still exists, 
20 the locate-and-notif y task asks the subscriber whether there 
is anything it can do for the subscriber at this time (step 
618). If the subscriber responds by saying no or pressing 
the M 6 W key, the locate-and-notif y task says "Goodbye" to 
the subscriber, sends a hang-up command to the channel, and 
25 then terminates. On the other hand, if the subscriber 

indicates that he wants to access his electronic assistant's 
commands, he must first do additional authentication, then 
the electronic assistant starts a command task which enables 
the subscriber to access those capabilities either through 
30 voice or DTMF commands (step 620) . The commands identified 
in Exhibit A are then available to the subscriber. 

If the subscriber accepts the call, the locate-and- 
notify task sends a connect reply parcel to the caller's 
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session. Referring to Fig- 24B, in response to receiving 
the connect reply parcel, the caller's session removes from 
its active gadget some of the capabilities that will no 
longer be necessary (e.g. interact ive-in, interactive-out, 
5 and recognition capabilities) (step 536) and sends the 

gadget to the subscriber using the parcel mechanism (step 
540) . Upon receiving the active gadget from the caller's 
session, the electronic assistant for the master session 
adds that active gadget to its session and connects both the 

10 active gadget and the focus gadget together thereby enabling 
the subscriber and the caller to communicate with each other 
(step 622 in Fig. 26) . The interface cards perform the 
switching function that connects the two gadgets together by 
allowing each gadget to listen to the output of the other 

15 channel (i.e., by allowing each channel to have access to 

the appropriate time slot on the MVIP bus line that carries 
the other party's signal) . 

When the connection is established between the 
caller and the subscriber, the electronic assistant switches 

20 into a background mode in which it monitors the subscriber's 
output for a command (i.e., "Wildfire") that will call it 
back into its foreground mode (step 624) . While the 
electronic assistant is in the background, it only responds 
to the single command which calls it back into the 

2 5 foreground and it ignores all of the other commands in its 
command set. 

Recall that it was assumed in the above discussion 
that a master session was not running when the VM wanted to 
deliver the request connect parcel from the caller's 
30 session. If, however, there is a master session running, 
the VM delivers the parcel to that master session. 

When an already existing master session receives a 
request to connect parcel from another caller's session, it 
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is stored in a queue. The master session repeatedly checks 
this queue for received parcels. When it detects the 
presence of a new parcel, it acts upon it immediately. in 
the case of the request to connect parcel, the master 
5 session initiates a notify task (see Fig. 27) . The notify 
task checks whether the subscriber is interacting with the 
master session through a phone channel (step 630) . if the 
subscriber is interacting with the session through 
communication channel other than a phone channel, it sends a 

10 notification to the subscriber which identifies the caller 

and notifies the subscriber that he has just received a call 
from that individual (step 632). The master session then 
sends a reply to the request to connect parcel back to the 
caller session indicating that the attempt to connect failed 

15 (step 634) . 

If the caller is communicating with the master 
session through a phone channel, the notify task first 
determines whether the subscriber has indicated that he is 
willing to accept an interruption to his call (step 636). 

20 For example, when the subscriber connected to the call he 
could have used a "Hold All Calls" command to indicate his 
preference to not be interrupted. The "Hold All Calls" 
command temporarily sets the subscriber's status as 
unavailable during a call being handled by the master 

25 session. 

If the subscriber indicated . that he did not want to 
be interrupted, the notify task sends a non-interactive 
notification to the subscriber of the caller's attempt to 
reach him and then reports the caller's session that the 
30 connect attempt failed (step 634). 

If the subscriber did not set his status to block 
interruptions, the notify task notifies the subscriber of 
the caller on hold (step 638). The system has a form of 
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call waiting in which it first plays a short tone that can 
be heard by both the subscriber and the party he is talking 
to and then it plays the name of the incaller, using the 
caller's spoken self identification. The interruption is 
handled so as to prevent the caller with whom the subscriber 
might be talking at that moment from hearing the message 
(referred to generally as the smart call waiting feature) . 
On some phones this is done by placing the caller on hold 
for a short period of time while the announcement is made. 
On other phones, it is possible to send the message to the 
earpiece on the subscriber's phone without the caller 
hearing it. 

The electronic assistant then gives the subscriber 
the option to reject the call or to accept the call. 
15 Current valid commands are Take-A-Message and I'll-Take-It. 
If the subscriber rejects the call or does not respond 
within a preselected period of time, the electronic 
assistant sends a fail reply to the caller's session (step 
642). In response to the fail reply, the caller's session 
may simply inform the caller that the subscriber could not 
be found or it may offer to take a message from the caller. 
If the caller accepts the message option, the electronic 
assistant records the caller's voice message and store it 
for the subscriber to play back at some later time. 

If the subscriber accepts the new call, the 
electronic assistant merges the caller's gadget into the 
master session (step 644) and places the first caller on 
hold. 

While the subscriber is connected to the new caller, 
if yet another call comes, the electronic assistant handles 
the new call in the same manner. Thus, it is possible for 
more than one caller to be on hold at the same time. 
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To switch back to the first caller, the subscriber 
issues a "Press-the-hold-button" command. if there is only 
one call waiting, the electronic assistant puts the second 
caller on hold and switches to the first caller. Thus, by 
5 using the "Press-the-hold-button" the subscriber can toggle 
back and forth between the two parties. However, if there 
is more than one call waiting, the electronic assistant 
responds to the "Press-the-hold-button" by placing the 
caller with whom the subscriber is presently speaking on 
10 hold and saying: "Shall I connect you with <caller's name>?« 
If the subscriber responds by saying "yes", the electronic 
assistant connects the subscriber to the identified caller. 
If the subscriber responds with "No", the electronic 
assistant again says "Shall I connect you with <next 
15 caller's name>?" In this manner, the electronic assistant 
proceeds cycles the group of parties that are on hold until 
the subscriber indicates that he wishes to establish a 
connection with one of the parties. 

Create Contact: 

20 Among the other commands available to the subscriber 

are certain non-message object creation commands. These are 
used to create contacts which are added to the subscribers 
contact list, and to create call and recorded reminders, 
both of which are held by the cron agent for delivery to the 

25 subscriber at the appropriate time. Through a phone 
conversation the subscriber invokes these commands by 
_ saying: " Cr eat e-A-Con tact " , Remind-Me-To-Call" and "Remind- 
Me" . An example dialogue which results from invoking the 
"Create-A-Contact" command is shown in Fig. 28. The 

30 mechanisms for implementing this dialog through the channel 
(and other dialogues which are described below) are the same 
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as were described earlier and will not be repeated in the 
following discussions. 

In response to receiving a "Create-A-Contact" 
command, the electronic assistant asks the subscriber "What 
5 kind - person, place or group?". The subscriber responds by 
identifying one of these types. In the illustrated dialogue 
of Fig. 28, the subscriber responds by saying "Person". The 
electronic assistant then asks for the person's name. Since 
the electronic assistant will use the subscriber's 

10 vocalization of the contact's name, it asks the subscriber 
to repeat the contact's name a second time to improve the 
quality of the stored vocalization. 

After the subscriber has identified the contact's 
name, the electronic assistant requires the subscriber to 

15 add a phone number by asking: "Which phone number?". In 

response, the subscriber can identify home, work, mobile or 
other. In the illustrated example, the subscriber responds 
by saying "Work". The electronic assistant then asks for 
the phone number and the subscriber enters the number 

2 0 followed by a # key. The electronic assistant acknowledges 

receipt of the number by saying "Got it. Now what?" 

If the subscriber wants to enter additional 
information for that contact, he uses the "Update-It" 
command. The electronic assistant responds by asking 
25 "What?". To this the subscriber responds by identifying the 
particular item of information which he wishes to add or 
update. If the subscriber says "Priority", the electronic 
assistant asks "Normal or high". After the subscriber 
selects one of the available priorities, the electronic 

3 0 assistant indicates it is ready for another command by 

saying "Got it". If the subscriber wishes to add or modify 
further information for that contact, he again uses the 
"Update-Tt" command followed by an indication of what 
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information is to be added or changed. In the illustrated 
example, the next item selected for update by the subscriber 
is the spelling of the contact's name. In response, the 
electronic assistant notifies subscriber that it is ready to 
accept the new information by saying "Begin spelling now". 

After the spelling has been entered, the electronic 
assistant indicates its readiness for the next command from 
the subscriber. If no command is forthcoming within a 
predetermined period of time, the electronic assistant 
prompts the subscriber for a command. If the subscriber 
indicates that he has no further additions or changes at 
this time, the electronic assistant indicates that the 
11 Cr ea t e-A-Contact w is over by saying the "Done". 

The subscriber can also create new contacts by 
15 copying them from the phone book. Phone book entries, 

however, do not have voice identifications associated with 
them. Thus, if the subscriber copies an entry from the 
phone book, he must add a voice identification to make it 
part of his contact list. 
20 Finally, a contact can be created as a result of 

receiving a message with an enclosed contact. When 
reviewing a message which contains a contact, the assistant 
asks the subscriber if he wishes to transfer the contact out 
of the message and into the subscriber's contact list. If 
25 the subscriber responds by saying "yes", then an procedure 

that is similar to the one for saving phone book listings is 
executed . 

It should be noted that the system is capable of 
capturing phone numbers under other circumstances and later 
use this captured information to assist the subscriber. For 
example, when the assistant responds to a call and offers 
the caller the option of leaving a message for the 
subscriber, it first asks the caller to enter his phone 
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number. After the phone number has been entered, the 
electronic assistant then records the caller's voice 
message. Also when the subscriber dials another number 
through the electronic assistant, that dialed phone number 
5 is remembered. This remembered information can later be 
used in response to a "Give-Them-A-Call" command or to 
create a contact for the person or place that is represented 
by the remembered phone number. 

Remind -me 

10 As indicated above, the subscriber can use the 

commands "Remind-Me" and "Remind-Me-To-Call" to create 
reminders. The dialogs associated with these tasks are 
shown in Figs. 29 and 30, respectively. 

To create a reminder, the subscriber issues the 

15 "Remind-Me" command. This causes the assistant to invoke 

the reminder task with an attribute to indicate the creation 
of a recorded reminder. At this point the assistant prompts 
for what the user would like to be reminded about. At that 
point it starts recording as a message whatever the 

20 subscriber says into the phone, to be played when the 

reminder comes due. The subscriber indicates the end of the 
message by pressing the key and the electronic assistant 

stores the recorded message in its database. 

After the recording is complete, the electronic 

25 assistant engages in further dialogue with the subscriber to 
establish a time at which the reminder is to be delivered. 
The electronic assistant asks "When?". The subscriber 
responds using one of several conventions that are available 
in the system for specifying a time. The conventions are 

3 0 shown in Table III. In the described example, the presenter 
responds by saying "Today". The electronic assistant then 
asks the subscriber for a time. Using touch tone signals 
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DTMF, the subscriber enters a specific time (e.g. 530 for 
5:30). The electronic assistant responds by restating the 
time which the subscriber has entered and asks for a 
confirmation from the subscriber. If the subscriber 
5 confirms the time, the electronic assistant indicates that 
the dialogue has concluded successfully by saying "Got it 
I'll set my watch alarm.". If the subscriber does not 
confirm the time (either because the electronic assistant 
made a recognition error or because he changed his mind) , 
10 the electronic assistant then repeats the above described 
sequence of operations to obtain a new time. 

If the subscriber issues the "Remind-me-to-call" 
command (see Fig. 30) , the create-reminder task asks the 
subscriber "Call whom?". The subscriber's response "John 
15 Smith" is recognized by the voice recognition capabilities 
of the ASR card and matched against utterances stored with 
the subscriber's contact list that identify the subscriber's 
contacts. If a match is found (i.e., if a contact from the 
subscriber's contact list is identified), a pointer to that 
20 contact is saved as part of this call reminder. 

The electronic assistant then engages in the 
previously described dialogue with the subscriber to obtain 
a delivery time for the reminder. After the delivery time 
has been entered, the electronic assistant asks the 
25 subscriber: "Should I know the topic for the call?". if the 
subscriber responds in the affirmative, the electronic 
assistant says "Recording-.. " to notify the subscriber that 
he may begin recording. As before, the subscriber ends the 
recording by pressing the # key. in response, the 
3 0 electronic assistant confirms that the command has been 
received by saying "Got it, I'll set my watch alarm". 

The electronic assistant sends the completed 
reminder to the Cron agent via the parcel mechanism. First, 
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it packages the reminder in a parcel that is addressed to 
the subscriber and includes its desired delivery time. it 
then places that parcel in a second parcel that is addressed 
to the Cron agent. The VM delivers the outer parcel to the 
5 Cron agent. Upon receipt, the Cron agent opens the parcel 
and pulls out the parcel that is inside; it checks the 
delivery time for that parcel and places it in a time 
ordered queue. The Cron agent keeps track of the delivery 
time of the top parcel. 

10 Handle Reminder Task: 

At the delivery time, the Cron agent wakes up and 
sends the reminder parcel to the indicated address. As 
described earlier, the VM handles the delivery of the 
reminder parcel. If there is no master session running, the 
15 VM starts up a session and delivers the parcel to that new 
session. If there is a master session running, the VM 
delivers the parcel to the master session. 

Referring to Fig. 31, when the session that receives 
the parcel detects that it has received a reminder parcel, 
20 it starts up a handle reminder task. The handle reminder 

task checks whether the current session is a master session 
(i.e. whether the session has the focus gadget) (step 700). 
If the session is communicating with the subscriber, the 
electronic assistant notifies the subscriber of the reminder 
25 using the existing focus gadget (step 702) . The method used 
for notifying the subscriber depends upon the channel 
through which the subscriber is connected to the session. 
If the subscriber is connected through a phone channel, the 
electronic assistant briefly interrupts his call, reports to 
3 0 him that a reminder has come due. After notifying the 

subscriber of the reminder, the electronic assistant places 
the reminder in an active reminder list in the box, where 
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the subscriber can review it during the current or any 
subsequent sessions (step 703). The user can then invoke a 
w Find-Active-Reminders" command to deal with the recently 
received reminder, 
5 If the subscriber is not connected to the session, 

the handle reminder task initiates a locate-and-notify task. 
The locate-and-notify task operates in the manner previously 
described except that instead of notifying the subscriber of 
a caller's attempt to reach him, it notifies the subscriber 
10 of the reminder. If the locate-and-notify task is unable to 
establish a connection with the subscriber, it sends a 
f ailed-to-connect reply to the handle reminder task. In 
response, the handle reminder task checks whether the 
subscriber can be reached through a non- interactive means 
15 by, for example, checking for a pager assigned to the 
subscriber. If no alternative means for notifying the 
subscriber exists, the electronic assistant places the 
reminder on the non-scheduled reminder stack (step 710) . If 
an alternative means for notifying the subscriber does 
20 exist, the electronic assistant sends a notification to the 
alternative gadget (step 708) before placing it on the non- 
scheduled reminder stack. 

If the locate-and-notify task establishes a 
connection with the subscriber, the reminder is played back 
25 to the subscriber (step 720) . After that, the electronic 

assistant asks the subscriber if he wants to reschedule the 
reminder (step 722) . If he elects to reschedule it, a 
rescheduling task is executed that enables the subscriber to 
change the delivery time for the reminder (step 724) . The 
3 0 modified reminder is then sent back to the cron agent for 

delivery at the rescheduled time. The handle reminder task 
then starts the command task loop running which enables the 
subscriber to access the full set of commands (step 726) . 
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Additionally, at any time during a session, a subscriber can 
"Find" and reschedule pending reminders using the "Update- 
it" task. (See Exhibit A). 

If the subscriber elects not to reschedule the 
5 reminder, the reminder is placed on a non-scheduled reminder 
stack (step 725) and the handle reminder task starts up the 
command task loop (step 724). If the call reminder 
identifies a contact, the subscriber can use the "Call Them" 
command at this point. Since the call reminder includes a 

10 pointer to the contact in the subscriber's contact list, the 
electronic assistant interprets the "Give-Them-A-Call" 
command as though it was equivalent to a "Call" command 
where the contact is the one in hand. The electronic 
assistant uses the gadget identified in the contact's object 

15 to place the call. 

If the locate-and-notify task reaches the 
subscriber's gadget but the connection to the subscriber is 
not accepted, the electronic assistant places the reminder 
in the active reminder stack (step 703). 

20 Other tasks that are implemented by the system are 

best described through examples of the commands and the 
dialogs that the system supports when interacting with the 
subscriber. A summary of the commands is presented in 
Exhibit A at the end of this specification. 

2 5 Find Command: 

Referring to Fig. 32, the subscriber can retrieve 
and review access various items of information that are 
stored for him by using the Find command. When the 
subscriber says "Find" while the electronic assistant is 

3 0 running the command task loop, the electronic assistant 

replies by asking "Find what?". At this point the 

- 89 - 



S DOC ID: <WO 9611542A2 I > 



WO 96/11542 PCT/US95/11737 



subscriber can identify onei of seven different items, 
namely , contact , all-the-contacts , phonebook-listings , 
messages , new messages, messages from, saved messages, 
reminders, and trash* The operation of the Find command 
5 will be illustrated using the contacts as the item which the 
subscriber wishes to manipulate. 

When the subscriber is asked for what it is that he 
wants to find, he replies by stating "All-The-Contacts" . 
The electronic assistant then accesses the subscriber's 

10 contact list and reports to the subscriber how many contacts 
are on the list. Then the electronic assistant waits for a 
command from the subscriber. The subscriber can search the 
contact list for a given name by issuing the Find "Contact" 
command. In response, the electronic assistant asks 

15 "Contact name?". The subscriber responds with a name, e.g. 
John Smith. The electronic assistant acknowledges by 
replying "Contact name, John Smith" to indicate it has the 
user object for that contact in its hand. The subscriber 
can then instruct the electronic assistant to: (1) describe 

2 0 it by issuing the describe command; (2) update it by issuing 

update command; (3) discard it by issuing the "Throw-lt- 
Away" command; or (4) call them using the "Give-Them-A-Call" 
command. 

If the subscriber issues the describe command, the 
25 electronic assistant replies by reporting to the subscriber 
the information that is stored for that contact. 

If the subscriber issues the update command, the 
electronic assistant initiates a dialog similar to that 
previously described with the Create command which enables 

3 0 the subscriber to modify or add to the information stored 

for that contact. 

The subscriber can instruct the electronic assistant 
to dial a number by saying "Call" which causes the 
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electronic assistant to invoke the call task. The call task 
asks the subscriber to provide the number, which may be 
provided either as a reference to a contact, by entering a 
specific number or by saying "this-one" , referring to a 
5 contact message or call reminder being pointed to; then, it 
dials the number- As soon as the ringing begins, the 
electronic assistant establishes a connection is between the 
subscriber's line and the outgoing call line so that the 
subscriber can complete the call. When that connection is 

10 made, the electronic assistant automatically switches into 
its background mode in which it will only respond to a 
particular command (e.g. "Wildfire") which causes it to 
switch back into the foreground. In its background mode, 
the electronic assistant disables its full command set so 

15 that the utterance of the command words during the course of 
the conversation will not unintentionally invoke a command 
task. 

Anytime the electronic assistant is running its 
command task which give the subscriber full access to its 
20 command set, the subscriber can instruct the electronic 

assistant to go into its background mode. He does this by 
saying the "That will be all for now". in response the 
electronic assistant replies, "Say Wildfire when you need 
me" and switches into its background mode. When the 
25 subscriber needs to access the full command set of the 

electronic assistant, he says "Wildfire". in response the 
electronic assistant moves back into its foreground mode and 
replies "Here I am", confirming that the electronic 
assistant is again fully active and that the subscriber may 
now access the full set of commands that are supported by 
the electronic assistant. 



30 
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Generating an Override Schedule: 

As indicated above, the "I-Will-Be" and "Create-An- 
Itinerary" commands enable the subscriber to generate an 
override schedule* A typical dialog for each of these 
5 commands is shown in Figs. 33 and 3 4 , respectively. The 

dialog for each of these commands is very similar. The "I- 
Will-Be" command is used for generating a single 
modification to the subscriber's schedule; the "Create-An- 
Itinerary" comand is used for generating more complex 
10 override schedules. 

When the subsrciber utters "I-Will-Be", the 
electronic assistant responds by asking "Doing What?" In 
the described embodiment, the subscriber may respond in one 
of the four following ways: 

15 Taking Calls 

Only Taking Important Calls 

Unavailable 

Running on Schedule 

"Taking Calls" indicates to the electronic assistant that 
20 all calls should be forwarded to the subscriber; "Only 
Taking Important Calls" indicates that only calls from 
contacts that are designated as high priority should be 
forwarded; "Unavailable" indicates that no calls should be 
forwarded; and "Running on Schedule" indicates that the 
25 override schedule is being cancelled in favor of the default 
schedule. 

In the illustrated example, the subscriber reponds 
by saying "Taking Calls." Next, the electronic assistant 
asks "Where?" To this the subscriber may respond with one 
30 of the following designations: work, home, car, mobile or 

other, for each of which it is assumed that a correspodning 
address or phone number exists. 
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After the subscriber has indicated how he will be 
reachable, the electronic assistant prompts him to indicate 
for how many hours to which the subscriber responds with a 
number. After the subscriber has indicated the time, the 
5 electronic assistant acknowledges his response by saying 
"Done". 

The "Create-An-Itinerary" command, a sample dialog 
of which is shown in Fig. 34, operates in a similar manner 
excpet that it allows the subscriber to use voice commands 
10 to build as large a override schedule as is desired. In 

other words, after the first entry has been completed, the 
electronic assistant asks "And then you'll be". in response 
the subscriber can enter more schedule information or can 
terminate the command routine by saying "Back on schedule." 

Calling Commands: 

A subscriber can ask his electronic assistant to 
place a phone call in one of three ways. If the party to be 
called is a known contact, the subscriber can identify the 
contact's name and the electronic assistant will obtain the 
information necessary to place the call from the 
subscriber's contact list- If the party is not a known 
contact, the subscriber can identify the party and provide, 
either by voice or by DTMF, a phone number to call. The 
third way is by relying on information available from 
context. That is, if the electronic assistant has just 
retrieved a voice mail message for the subscriber, the 
subscriber can say "Give-Them-A-Call" and the electronic 
assistant will call the party that left the voice mail 
message. Additionally you could say "Call" "This-One". 
This is possible because the stored message identifies the 
caller either as a contact or by a telephone that the caller 
was asked to leave. 
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The stored objects for the contacts also include a 
pointer to a note (e.g. a voice message) which the 
subscriber can generate and attach to the object. When the 
subscriber instructs his electronic assistant to call the 
5 contact, the electronic assistant plays the stored message 
to the subscriber while it is attempting to establish the 
connection with the contact. The note might include 
information about the contact which the subscriber wishes to 
be reminded of whenever he calls that contact. For example, 
10 he may wish to know the name of contact's secretary so that 
he can address her by name if she answers the phone. 

Request Connection: 

It is possible for a subscriber to infer to his 
assistant that he would like to talk with another subscriber 

15 without actually placing a call to her. This is referred to 
as requesting a connection with another user. One mechanism 
for accomplishing this is for the subscriber to explicitly 
request a connection. This causes the subscriber's 
assistant to inform the electronic assistant to whom the 

2 0 request is being made that a connection at some point is 
desired. When this request is received, the receiving 
assistant holds onto the request until it is communicating 
with its subscriber. During the next session with the 
subscriber, the receiving assistant informs its subscriber 

25 of the requested connection and the availability of the 

party requesting the connection. The subscriber can then 
decide to connect with the other party or ignore the 
request . 



30 



Feature Phone: 

The feature phone is an object within the system 
that builds smart-phone like functionality on top of the 
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conference object. The feature phone allows the user to 
manage multiple calls. The user can request that their 
assistant place different callers on hold, hang up on 
certain callers, and place calls to other contacts through 
5 the feature phone. When a subscriber receives a call using 
the system's smart call-waiting, the call is managed by the 
feature phone. Current implemented functionality includes 
the user commands: call, hang-up, and press- the-hold button. 
These commands are used to manipulate an unlimited number of 
10 simultaneous calls. 

Virtual Hallway: 

As indicated above, the subscriber can ask his 
electronic assistant what is in the virtual office, what it 
is holding in its hand or pointing to, what is in the 

15 "trash", what new messages have come in, etc. However, a 
concept of a virtual hallway is also supported in the 
described embodiment. The virtual hallway is made up of the 
collection of virtual offices. The subscriber can ask his 
electronic assistant what other subscribers are presently 

20 communicating with their electronic assistants by using a 
"Who-Else-Is-Around" command. This feature can be 
"filtered" so that the subscriber will only see people who 
are in the subscriber's contact list, or who are members of 
certain specified groups, or who are part of a particular 
25 phone book. 

In the described embodiment, which supports internal 
switching, and given that the virtual hallway feature 
enables the subscriber to see other subscribers who are on 
the system, the system also offers the capability to connect 
and share information between distributed or travelling 
groups or subscribers more quickly than is possible by 
messaging. in addition, "visibility" in the hallway can 
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also include the ability to see what the other person is 
doing (e.g. on the phone, reading a message, etc.). 
Depending upon what the subscriber has told her assistant 
about her availability and accessibility, the electronic 
5 assistant will control how visible the subscriber is (or how 
visible she will be) in the virtual hallway. 

Implemented Assistants 

The described embodiment implements two types of 
assistants, namely, the electronic assistant described above 
10 and a reception assistant. The reception assistant is much 
simpler than the electronic assistant, its job is to answer 
a central phone number (for all of the subscribers on a 
particular system) and route the call to the assistant for a 
specific subscriber. This avoids using the DID line and 
15 allows one number to handle a large number of subscribers. 
The basic operation of this assistant is to ask for the 
extension of the subscriber being dialed and then start up 
that subscriber's assistant to handle the rest of the call. 
At this point the call can be handled in a similar manner to 
20 the answer call task previously described. in the described 
embodiment, it is assumed that only subscribers (not their 
contacts) use the reception assistant. As such, once the 
extension for the subscriber is entered the subscribers 
assistant immediately asks for the subscriber's passcode. 
25 Since the assistant can assume it is the subscriber it does 
not need to ask for the name of the person calling, as 
previously described. 

It is possible to also include electronic assistants 
with different "personalities", i.e., assistants whose 
functionality is tailored to the particular subscriber for 
whom that assistant will be providing services. For 
example, there could be an electronic assistant for customer 
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service representatives. That electronic assistant would 
implement a set of commands and tasks that are more 
appropriate to the role of a customer service representative 
or even a group of customer service representatives. it 
might handle an incoming call by asking the caller a 
sequence of questions designed to gather information 
relating to the particular customer service problem. That 
information might be used by the electronic assistant to 
determine the most appropriate person to receive the call 
and it might also be used to assist the customer 
representative in responding to the customers concerns by 
retrieving relevant stored data from a database. The 
electronic assistant might place the caller in a queue with 
other callers having customer service questions and then 
15 connect them as resources become available. 

In addition to "customizing" the personality and 
skills of a subscriber's electronic assistant other 
specialized electronic assistants can be added to the 
system. For instance, a system may have, or be able to 
connect with, a stockbroker assistant to check on particular 
investments . 

Other embodiments are within the following claims. 
What is claimed is: 



20 
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Call 225 

- Phone number 768 
Create a contact 222 
- - Penon 737 

-Place 752 

- Group 476 

- Update it 848 

Do me a favor J 62 

• Change the prompts 287 

• Train vocabulary 886 
Find 346 

• Contact 266 

- All the contacts 282 

- New messages 663 1 

• New messages from 6632 

- Messages 6371 

- Messages from 6372 

• Filed messages 363 

- Phone book listing 72S 

- Reminders 736 

- Tutorials 688 
Good-bye Wildfire 429 



It's me, Wildfire • 
I will be 4t2 

•Taking calls 822 

• Only caking important calls 664 

• Running on schedule 767 
•Unavailable 862 

Nevermind • 
Remind me 763 
Remand me to call 768 
Send a message 726 

-Send it 7482 
Send a page 7271 
Send a reply 7272 
Tell me 863 

- Where do you think I am? 9391 
• Who's on hold' 946 

- Who else is around? 934 
That'll be all for now ft 2 
What are my options? 0 
What'* it sayf 934 
Wildfire 94$ 



Manipulating data In hone' 

- Describe it 3481 
-FHe« 3482 

• First item 3463 

• Give them a call 462 

• Next item 648 

- Previous item 7481 

- Send a copy 722 

- Throw it away 842 

• Go back 422 
-Update it 848 

- What are you holding? 929 

• Where were we? 999 
Managing incoming calls 

- I'll take it 4 

• Take a message 8 

- Wildfcre 94S 

Press the hold button 7841 
Hang up 487 
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Claims : 

1. A method implemented by a computer-based 
electronic assistant to receive and manage incoming calls to 
a subscriber, said method comprising: 
5 receiving an incoming call to the subscriber from a 

caller; 

in response to receiving the incoming call, 
establishing a first connection between the electronic 
assistant and the caller; 
10 establishing a second connection between the 

electronic assistant and the subscriber; 

over the second connection, electronically notifying 
the subscriber of the incoming call; 

in response to receiving a call accept command from 
15 the subscriber over the second connection, linking the 

caller and the subscriber so that they may communicate with 
each other ; 

upon linking the subscriber to the caller, switching 
the electronic assistant to a background mode in which said 

20 electronic assistant continues to monitor the subscriber 
over the second connection while the subscriber is linked 
with the caller; and 

in response to receiving a summoning command, 
switching the electronic assistant into a foreground mode, 

25 wherein the electronic assistant when in its 

background mode responds to a first set of commands 
including at least the summoning command and when in its 
foreground mode responds to a second set of commands, said 
second set of commands being larger than said first set of 

3 0 commands. 
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2 . The method of claim 1 wherein said second set of 
commands includes commands that are unavailable in the first 
set including a command that causes the electronic assistant 
to terminate the first connection. 

5 3. The method of claim 1 wherein said second set of 

commands includes commands that are unavailable in the first 
set including a command that causes the electronic assistant 
to establish a connection between the subscriber and another 
party . 

10 4. The method of claim 1 wherein said second set of 

commands includes commands that are unavailable in the first 
set including a command that causes the electronic assistant 
to send a message to another party. 

5. A method of processing an electronic reminder 
15 that is addressed to a subscriber, said electronic reminder 

including subscriber-generated content and a specified time 

at which it is to be delivered to the subscriber, said 

method implemented by a computer-controlled electronic 

assistant and comprising: 
20 storing the electronic reminder in an electronic 

database that is accessible to the electronic assistant; 

when current time coincides with the specified time, 

detecting that the stored electronic reminder has become 

due; 

25 in response to detecting that the stored electronic 

reminder has become due, identifying a communications device 
through which the subscriber can be reached at the specified 
time ; 

establishing a connection to the communications 

30 device; 
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upon reaching an answering party through the 
communications device, electronically notifying the 
answering party that the call is intended for the 
subscriber; ~ 
5 electronically informing the answering party that 

the answering party may accept the call by issuing an accept 
reply; . 

if the call is accepted by the answering party, 
electronically delivering the contents of the electronic 
10 reminder to the answering party through said communications 
device. 

6, The method of claim 5 wherein the step of 
identifying the communications device comprises: 

in response to detecting that the stored electronic 
15 reminder has become due, recognizing that the addressee on 
the stored electronic reminder is the subscriber; 

executing a locator algorithm to determine the 
likely whereabouts of the subscriber at the specified time; 
and 

20 if the locator algorithm determines a location at 

which the subscriber is likely to be at the specified time, 
identifying a communications device that is associated with 

that location, 

wherein the communications device that is associated 
25 with that location is the communications device to which 
said connection is established. 

7. The method of claim 5 wherein the electronic 
assistant is electronically connected to a plurality of 
different communications media and wherein the step of 

30 establishing the connection to the communications device 
comprises: 
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selecting the most appropriate one of said plurality 
of different communications media for placing a call to the 
communications device; 

establishing a channel to a selected communications 

5 medium; and 

establishing a connection to the communications 
device through said channel. 

8 • A method of processing an electronic reminder 
that is addressed to a subscriber, said electronic reminder 
10 including subscriber-generated content and a specified time 
at which it is to be delivered to the subscriber, said 
method implemented by a computer-controlled electronic 
assistant and comprising: 

storing the electronic reminder in an electronic 
15 database accessible to the electronic assistant; 

when current time coincides with the specified time, 
detecting that the stored electronic reminder has become 
due; 

in response to detecting that the stored electronic 

2 0 reminder has become due, recognizing that the addressee on 

the stored electronic reminder is the subscriber; 

executing a locator algorithm to determine the 
likely whereabouts of the subscriber at the specified time; 

if the locator algorithm determines a location at 
25 which the subscriber is likely to be at the specified time, 
identifying a communications device that is associated with 

that location; 

establishing a channel to a communication medium 
that is appropriate for placing a call to the communications 

3 0 device; 

establishing a connection to the communications 
device through the channel; 
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upon reaching an answering party through the 
communications device, notifying the answering party that 
the call is intended for the subscriber; 

notifying the answering party that the answering 
5 party may accept the call by issuing an accept reply; 

if the call is accepted by the answering party, 
delivering the contents of the electronic reminder to the 
answering party. 

9. The computer-implemented method of claim 8 
10 further comprising: 

after delivering the contents of electronic reminder 
to the answering party, maintaining the connection to said 
communications device so that the subscriber can issue 
commands to the electronic assistant; 
15 monitoring the connection for subsequently 

subscriber-issued commands ; 

in response to receiving the subsequently 
subscriber-issued commands, performing other operations for 
the subscriber. 

2 0 10. A method implemented by a computer-based 

electronic assistant to receive and manage incoming calls to 
a subscriber, said method comprising: 

receiving an incoming call to the subscriber from a 

caller; 

25 in response to receiving the incoming call, 

establishing a first connection between the electronic 
assistant and the caller; 

through a dialogue between the electronic assistant 
and the caller over the first connection, determining the 

30 identity of the caller; 
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detecting that the subscriber is presently 
interacting with the electronic assistant through a second 
separate connection; 

electronically alerting the subscriber over the 
5 second connection that there is an incoming call for the 
subscriber; 

electronically identifying to the subscriber the 
identity of the caller; 

monitoring the second connection for a response sent 
10 by the subscriber to the electronic assistant directing the 
electronic assistant how to process the incoming call. 

11. A method implemented by a computer-based 
electronic assistant for managing information and connection 
resources for a plurality of subscribers including a first 
15 subscriber and a second subscriber, said method comprising: 

receiving a call from the first subscriber to the 
electronic assistant over a communications media; 

in response to receiving the call from the 
subscriber, establishing a first connection between the 
20 electronic assistant and the first subscriber; 

identifying the first subscriber as the source of 
the call; 

starting up a first session in said electronic 
assistant, said first session being a thread of execution of 
25 code for managing data and performing functions on behalf of 
said first subscriber; 

within said first session, receiving a first command 
sent by the first subscriber to the electronic assistant 
over the first connection, said first command instructing 
3 0 the electronic assistant to perform a function relating to 
the second subscriber; 
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within said first session, responding to said first 
command by sending a first message addressed to the second 
subscriber, said first message containing information 
relating to the first command; 
5 in response to said first message, starting up a 

second session, said second session being a thread of 
execution of code for managing data and performing functions 
on behalf of said second subscriber, said second session 
being separate from said first session; 
10 within said second session, receiving the first 

message ; and 

within said second session and in response to 
receiving said first message, performing a function that 
produces a result that is responsive to the first message. 

15 12. The method of claim 11 further comprising from 

within said second session, sending a reply message to the 
first session reporting the result. 

13. The method of claim 12 further comprising 
within said first session, receiving the reply message from 

20 the second session and in response thereto executing some 
function which uses information in said reply message and 
that is responsive to said first command. 

14 . A computer-implemented method of processing 
communications through a multimedia interface that includes 

2 5 a plurality of interface devices and a plurality of 

input/output devices, each of said plurality of interface 
device being capable of connecting to a different one of a 
plurality of different communications networks, each of said 
plurality of input/output devices capable of processing a 
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different one of a plurality of media types, said method 
comprising: 

establishing a channel representing a physical 
connection to any selected one of the plurality of 
5 communications networks through said interface devices; 

attaching an appropriate subset of a plurality of 
ports to the channel, wherein each port of said plurality of 
ports represents a different one of said plurality of 
input/output devices and wherein the appropriate subset of 
10 said plurality of ports includes ports which correspond to 
input/output devices that are capable of connecting to the 
selected communications network; 

executing an operation that generates an item of 
information that is to be communicated through the 
15 multimedia interface to at least one of said plurality of 
communications networks ; 

retrieving from a memory a multi-media reference to 
said item of information, said multi-media reference to said 
item of information containing a plurality of references to 
20 said item of information, each of which refers to said item 
of information in a different one of a plurality of formats, 
each of said plurality of formats being for a different 
media type; 

passing said multimedia reference to the ports 

25 attached to said channel; 

in response to receiving said multi-media reference 
at the attached ports, retrieving the item of information 
- from memory in a particular one of the formats identified in 
the multi-media reference, wherein the step of retrieving is 
30 performed by one of the attached ports that is capable of 

processing the format of the retrieved item of information; 
and 
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input/output device for delivery over the connected 
communications network, the input /output device to which the 
retrieved item is passed for delivery being the input/output 
device that is associated with the attached port that 
retrieved the item of information from memory- 
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